Modification of plant lignin content

ABSTRACT

DNA constructs comprising a first DNA segment that corresponds to at least a portion of a gene in the monolignol biosynthetic pathway, a spacer DNA segment, and a second DNA segment that is complementary to the first DNA segment can be used to reduce or modulate the lignin content in plants. In some embodiments, DNA constructs comprise at a least a portion of a gene for 4CL, C3H, CCR, C4H or CCoAOMT. Vascular-preferred and constitutive promoters can be used to drive expression of the constructs.

This application is a divisional of U.S. patent application Ser. No. 10/946,650, filed on Sep. 22, 2004.

FIELD OF INVENTION

The invention relates to genetically modifying plants, especially trees, through manipulation of the lignin biosynthesis pathway, and more particularly, to genetically modifying plants through the down regulation of 4CL, C3H, CCR, C4H or CCoAOMT to achieve altered lignin content.

BACKGROUND OF THE INVENTION

Lignin, a complex phenolic polymer, is a major component in cell walls of secondary xylem. In general, lignin constitutes 25% of the dry weight of the wood, making it the second most abundant organic compound on earth after cellulose. Although lignin contributes to the strength and rigidity of the stem, and protects microfibrils from physical, chemical and biological attack, it hinders the process of converting wood into paper. In order to liberate wood fibers for the production of paper, most of the lignin must be removed from the processed wood chips. Extracting lignin from wood fibers is a difficult and expensive process, involving harsh chemicals and yielding toxic waste products.

Consequently, practitioners have searched for more cost-effective and environmentally-friendly methods of reducing the lignin content in wood products. One alternative involves genetically modifying the biosynthetic pathway of lignin. For example, Chiang et al. have attempted to reduce the lignin content in a plant by genetically modifying the plant's monolignol biosynthetic pathway. See WO 02/20717. The method involved transforming a plant with multiple genes from the phenylpropanoid pathway, including key lignin control sites in the monolignol biosynthetic pathway such as the enzymes 4-coumarate-CoA ligase (4CL), coniferyl aldehyde 5-hydroxylase (CALD5H), S-adenosyl-L-methionine (SAM)-dependent 5-hydroxyconiferaldehyde, O-methyltransferase (AldOMT), coniferyl alcohol dehydrogenase (CAD) and sinewy alcohol dehydrogenase (SAD). Meanwhile, others have attempted to reduce lignin content by individually introducing copies of these genes into plant genomes. See e.g. WO 00/58489 (Scald); WO 99/24561 (4CL). Practitioners also have employed these genes in antisense strategies to modulate lignin biosynthesis. See e.g. WO 99/24561. While some of these methods successfully down-regulated lignin synthesis, the down-regulation of lignin can be detrimental to plant phenotype. Anterola et al., Phytochemistry, 61:221-294 (2002). Thus, improved methods for modulating lignin expression are needed.

A recent method of silencing gene expression at the mRNA level has emerged as a powerful alternative to prior technologies. RNA interference (RNAi) is a post-transcriptional process triggered by the introduction of double-stranded RNA (dsRNA) which leads to gene silencing in a sequence-specific manner. The initial discovery of RNA interference in C. elegans (Fire et al., Nature, 391:806-811 (1998) and U.S. Pat. No. 6,506,559) has been followed by numerous examples of organisms where introduction of dsRNA can induce the sequence-specific silencing effect. For example, RNAi has been reported to occur naturally in organisms as diverse as nematodes, trypanosmes, plants, fungi and animals. In nature, RNAi most likely serves to protect organisms from viruses, modulate transposon activity and eliminate aberrant transcription products.

Studies in the fruit fly Drosophila melanogaster suggest that RNAi is a two-step mechanism (Elbashir et al., Genes Dev., 15(2): 188-200 (2001)). First, long dsRNAs are cleaved by an enzyme known as Dicer into 21-23 ribonucleotide (nt) fragments, called small interfering RNAs (siRNAs). Then, siRNAs associate with a ribonuclease complex (termed RISC for RNA Induced Silencing Complex) which target this complex to complementary mRNAs. RISC then cleaves the targeted mRNAs opposite the complementary siRNA, which makes the mRNA susceptible to other RNA degradation pathways.

RNAi may offer an alternative to prior methods of controlling lignin synthesis. Before the potential can be realized, however, DNA constructs that can initiate RNAi processes in the context of lignin synthesis must be developed.

SUMMARY

In one embodiment, DNA constructs useful for modulating the expression of lignin-related genes are provided. In another embodiment, methods of modulating the expression lignin in plants are provided. In addition, recombinant plants are produced that comprise DNA constructs useful for modulating the expression of lignin-related genes.

In one embodiment, a DNA construct comprises a promoter operably linked to a first DNA segment that corresponds to at least a portion of a gene in the monolignol biosynthetic pathway, a spacer DNA segment, and a second DNA segment that is complementary to the first DNA segment, wherein the first and second DNA segments are arranged in a 5′ to 3′ direction, respectively, in the DNA construct. In some embodiments, a gene in the monolignol biosynthetic pathway is selected from the group consisting of 4CL, C3H, CCR, C4H and CCoAOMT.

In another embodiment, a DNA construct comprises a promoter operably linked to a first DNA segment that corresponds to at least a portion of a 4-coumarate co-enzyme A ligase (4CL) gene, a spacer DNA segment, and a second DNA segment that is complementary to the first DNA segment, wherein the first and second DNA segments are arranged in a 5′ to 3′ direction, respectively, in the DNA construct. Methods of modulating, inhibiting and/or reducing the expression of lignin in a plant comprising the use of such constructs also are provided.

In yet another embodiment, a method of inhibiting the expression of lignin in a plant cell comprises integrating into said plant cell's genome a construct comprising, in a 5′ to 3′ direction, a promoter, a first DNA segment that corresponds to at least a portion of a 4CL gene, a spacer DNA segment and a second DNA segment that is complementary to the first DNA segment and growing said plant cell. Plants and plant cells produced by such processes also are provided, as are paper and wood products derived there from. Pulp and pulp-derived products derived from such transgenic plants also are provided. In another aspect, solid wood products derived from such transgenic plants are provided. The wood products include, for example, timber, lumber and composite.

In still another embodiment, plant cells are produced that comprise in a 5′ to 3′ direction, a promoter, a first DNA segment that corresponds to at least a portion of a 4CL gene, a spacer DNA segment and a second DNA segment that is complementary to the first DNA segment. The promoter, which is operably linked to the first DNA segment, can be endogenous or exogenous to the plant cell's genome. In other embodiments, plant cells are produced wherein the first DNA segment corresponds to at least a portion of a C3H, C4H, CCR or CCoAOMT gene.

In plants, a LIM protein has been demonstrated to control a number of genes in the lignin biosynthesis pathway, critically important for developing wood (Kawaoka A, Ebinuma H 2001 Transcriptional control of lignin biosynthesis by tobacco LIM protein. Phytochemistry 57:1149-1157, Kawaoka et al. Plant J. 22: 289-301 (2000). Thus, in still another embodiment, plant cells are produced that comprise in a 5′ to 3′ direction, a promoter, a first DNA segment that corresponds to at least a portion of a LIM gene, a spacer DNA segment and a second DNA segment that is complementary to the first DNA segment.

In another embodiment, a method of making wood involves integrating into a plant cell's genome a DNA construct comprising, in a 5′ to 3′ direction, a promoter, a first DNA segment that corresponds to at least a portion of a gene in the monolignol biosynthetic pathway, a spacer DNA segment and a second DNA segment that is complementary to the first DNA segment, growing said plant cell and obtaining said wood.

In another aspect, a method of making wood pulp involves integrating into a plant cell's genome a DNA construct comprising, in a 5′ to 3′ direction, a promoter, a first DNA segment that corresponds to at least a portion of a gene in the monolignol biosynthetic pathway, a spacer DNA segment and a second DNA segment that is complementary to the first DNA segment, growing said plant cell and obtaining said wood pulp.

In yet another embodiment, a method of making paper involves integrating into a plant cell's genome a DNA construct comprising, in a 5′ to 3′ direction, a promoter, a first DNA segment that corresponds to at least a portion of a gene in the monolignol biosynthetic pathway, a spacer DNA segment and a second DNA segment that is complementary to the first DNA segment, growing said plant cell and obtaining said paper.

Other objects, features and advantages of the present invention will become apparent from the following detailed description. The detailed description and specific examples, while indicating preferred embodiments, are given for illustration only since various changes and modifications within the spirit and scope of the invention will become apparent to those skilled in the art from this detailed description. Further, the examples demonstrate the principle of the invention and cannot be expected to specifically illustrate the application of this invention to all the examples where it will be obviously useful to those skilled in the prior art.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1(A) (SEQ ID NOS: 5, 6, 7, 8, 15, 9, & 64 respectively in order of appearance) and FIG. 1(B) (SEQ ID NOS: 12 & 13) provide components for DNA constructs described herein.

FIG. 2(A) (SEQ ID NOS: 65, 25, and 26 respectively in order of appearance) and FIG. 2(B) (SEQ ID NOS: 34 & 33) provide components for DNA constructs described herein.

FIG. 3 provides a bar chart showing the resulting heights of transgenic Eucalyptus trees.

FIG. 4A provides a bar chart showing the resulting heights of transgenic Eucalyptus trees, while FIG. 4B depicts the mean lignin content of the transgenic trees.

FIG. 5 provides the nucleic acid sequence of the pine 4CL gene (SEQ ID NO: 66).

FIG. 6(A) (SEQ ID NOS: 18, 19, 20, & 21 respectively in order of appearance) and FIG. 6(B) (SEQ ID NOS: 22, 23, 67, & 48 respectively in order of appearance) identify the nucleic acid sequences of several pine 4CL fragments.

FIG. 7 provides two diagrams of the inventive DNA constructs. The upper diagram shows the general design for an inverted repeat of the gene of interest driven by the SuperUbiq promoter. The inverter repeat comprises a from the yabby gene (SEQ ID 64) and the same segment of the gene of interest in the opposite orientation (back arrow). A transcriptional terminator completes the construct. The lower diagram shows the general design for an inverted repeat of the gene of interest driven by the Pine 4CL promoter. The inverter repeat comprises a segment of the gene of interest (forward arrow), an intron from the yabby gene (SEQ ID 64) and the same segment of the gene of interest in the opposite orientation (back arrow). A transcriptional terminator completes the construct.

FIG. 8 provides a schematic of several 4CL DNA constructs for use in modulating lignin in pine trees. The constructs use the general design as described in FIG. 7. The figure shows a series of constructs that use the SuperUbiq promoter and a selection of segments from the pine 4CL gene (SEQ ID 66). pWVC60 comprises fragment A (SEQ ID 18), pWVC62 comprises fragment B (SEQ ID 19), pWVK158 comprises of fragment C (SEQ ID 20), pWVK154 comprises of fragment D (SEQ ID 21), pWVK157 comprises of fragment E (SEQ ID 22) and pWVK155 comprises of fragment F (SEQ ID 23).

FIG. 9 provides a schematic of several 4CL DNA constructs for use in modulating lignin in pine trees. The constructs use the general design as described in FIG. 7. The figure shows a series of constructs that use the 4CL promoter and a selection of segments from the pine 4CL gene (SEQ ID 66).). pWVK143 comprises fragment A (SEQ ID 18), pWVC46 comprises of fragment C (SEQ ID 20), pWVC40 comprises of fragment D (SEQ ID 21), pWVC43 comprises of fragment E (SEQ ID 22) and pWVC44 comprises of fragment F (SEQ ID 23).

FIG. 10 graphically demonstrates the modulation of lignin levels by 4CL RNAi constructs. Lignin values are the percent of lignin in the cell wall material as measured by NMR.

FIG. 11 is plasmid map of the Eucalyptus 4CL construct pARB345.

FIG. 12 is plasmid map of the Eucalyptus 4CL construct pARB339.

FIG. 13 is plasmid map of the Eucalyptus 4CL construct pARB341.

FIG. 14 provides mass spectra of loblolly pine samples. 2000c=control; 1268b=transgenic tree comprising the DNA construct pARB585.

FIG. 15A is a scatter plots of PC1 scores versus PC2 scores of mass spectra collected using a mass range of m/z 50-200 for transgenic loblolly pine samples. FIG. 15B is a scatter plot highlighting the clustering of constructs pWVC41 and control.

FIG. 16A is a scatter plot highlighting the clustering of constructs pWVK154, pWVC40 and controls. FIG. 16B is a scatter plot highlighting the clustering of constructs pWVK158, pWVC46 and controls.

FIG. 17 is a mass spectra of loblolly pine samples from the constructs selected in FIG. 16A. The pyrolysis fragments assigned to the lignin peaks are shown above the control spectrum. The m/z value on the x-axis represents the ratio between the mass of a given ion and the number of elementary charges that it carries.

FIG. 18 is a ¹³C CP/MAS spectra of a line of transgenic loblolly pine transformed with pWVK154 and an untransformed control. The spectra demonstrate a decrease in the aromatic and methoxyl carbons relative to the carbohydrate region (˜60-110 ppm) in the transgenic line relative to the control line.

FIG. 19 is a scatter plot of NMR-determined lignin values and PLS-predicted lignin values determined by full cross validation of the PLS model using 2 principal components.

DETAILED DESCRIPTION

In one embodiment, DNA constructs can be used for suppressing the expression of targeted genes. The constructs and methods described herein can be used in individual cells in vitro or in vivo. In general, the constructs selectively suppress target genes by encoding double-stranded RNA (dsRNA) and initiating RNA interference (RNAi). In a preferred embodiment, the DNA constructs are used to reduce the lignin content in plants.

In one aspect, a DNA construct useful for modulating the lignin content of plants is provided. In one embodiment, a DNA construct comprises a promoter operably linked to a first DNA segment that corresponds to at least a portion of a 4-coumarate co-enzyme A ligase (4CL) gene, a spacer DNA segment, and a second DNA segment that is complementary to the first DNA segment, wherein the first and second DNA segments are arranged in a 5′ to 3′ direction, respectively, in the DNA construct. Thus, when transcribed, the DNA constructs yield a RNA molecule comprising a first RNA segment corresponding to at least a portion of a 4CL gene, a spacer RNA segment and a second RNA segment that is complementary to the first RNA segment. Constructs comprising DNA segments for C3H, C4H, CCoAOMT and CCR operate in similar fashion.

While the mechanism by which the invention operates is not fully understood, and the inventors do not wish to limit their invention to any particular theory, it is believed that the first and second RNA segments of the resulting RNA molecule form a stem-loop. The dsRNA of the stem loop likely is degraded into small interfering RNA (siRNA) of about 21-23 nucleotides in length. Then, siRNAs associate with a ribonuclease complex (termed RISC for RNA Induced Silencing Complex) which target this complex to complementary mRNAs. RISC then cleaves the targeted mRNAs opposite the complementary siRNA, making the mRNA susceptible to other RNA degradation pathways.

DEFINITIONS

The phrases “target gene” and “gene of interest” are used interchangeably herein. Target gene, as understood in the current context, is used to mean the gene that is pinpointed for modulation or suppression. The targeted gene may or may not contain regulatory elements such as, for example, a transcription factor binding site or enhancer. Genes that can be chosen for suppression include those that code for structural proteins, such as cell wall proteins, or for regulatory proteins such as transcription factors and receptors, as well as other functional genes. Furthermore, the term is meant to include not only the coding region of a polypeptide but also introns present in the DNA, regulatory elements, the promoter and the transcription terminator. Thus, “at least a portion of the target gene” is meant to include at least a portion of the transcribed sequence and/or at least a portion of the promoter and/or at least a portion of the terminator of the gene of interest.

DNA constructs described herein, at their most basic level, comprise a promoter, one or more DNA segments and a transcription terminator. As used herein, “DNA segment” is meant to refer to a deoxyribonucleic acid molecule comprised of at least several contiguous bases. The DNA segment that corresponds to the target gene may be 30 base pairs (bp) or greater in length, preferably at least 50 bp and less than 2000 bp, and more preferably at least 100 bp and less than 750 bp. The DNA segment can be single- or double-stranded. A DNA segment, within the context of the present invention, can include a gene or cDNA or a portion thereof, or it can include a promoter or a regulatory element or a portion thereof.

The term “RNA segment” refers to a ribonucleic acid molecule comprised of at least several contiguous bases. The RNA segment may be a transcript, i.e. an mRNA molecule that codes for an entire polypeptide, or it may be a portion thereof. Furthermore, the RNA segment need not code for a polypeptide or any portion thereof, as long as the segment meets the qualities of an RNA segment defined herein. For example, an RNA segment may comprise an intron, a 5′-UTR, or a 3′-UTR, which do not encode peptides. An RNA segment also is produced when a DNA segment comprising a promoter, a regulatory element, or a non-gene sequence is transcribed.

The term “spacer” refers to a series of contiguous nucleotides that separates two DNA or RNA segments. In one example, a “spacer DNA segment” codes for a “spacer RNA segment” that separates two RNA segments. The length of a spacer may vary over a wide range, from 10 base pairs (bp) to 2000 bp or more. When very long complementary segments of DNA are separated by a short spacer, the construct may be unstable. Therefore, the spacer preferably should be between ¼ to 2 times the length of the segments it is separating. For example, if complementary DNA segments of 160 bp are present, the spacer segment between them would preferably be between 40 to 320 bp. The spacer may encode an intron that is spliced out of the transcript so that the resulting spacer RNA is much shorter than the complementary DNA segments of the transcript.

“Complementary” RNA or DNA segments are segments that will specifically bind to each other. Preferably, the sequence of two complementary segments should be at least 80% complementary to each other. More preferably, the complementarity should be at least 85%, 90%, 95%, 96%, 97%, 98%, 99% or even 100%. The DNA segments that are complementary to each other may be 30 base pairs (bp) or greater in length, preferably at least 50 bp and less than 2000 bp, and more preferably at least 100 bp and less than 750 bp.

By 95% complementarity, for example, it is meant that nucleotides of the complementary RNA or DNA segments will bind to each other in an exact base-to-base manner, except that one RNA or DNA segment may contain up to 5 point mutations per 100 bases of the other complementary strand of the RNA or DNA segment. The point mutations may be in the form of a deleted base or a substituted base. Furthermore, these mutations of the reference sequence may occur at the 5′ or 3′ terminal positions of one of the complementary nucleotide sequences or anywhere between the terminal positions, interspersed either individually among nucleotides in the reference sequence or in one or more contiguous groups within the reference sequence.

As a practical matter, percent complementarity, as well as identity, can be determined, for example, by comparing sequence information using the GAP computer program, version 6.0, available from the University of Wisconsin Genetics Computer Group (UWGCG). The GAP program utilizes the alignment method of Needleman and Wunsch (1970). Briefly, the GAP program defines similarity as the number of aligned symbols (i.e., nucleotides or amino acids) which are similar, divided by the total number of symbols in the shorter of the two sequences. The preferred default parameters for the GAP program include: (1) a unary comparison matrix (containing a value of 1 for identities and 0 for non-identities) for nucleotides, and the weighted comparison matrix of Gribskov and Burgess (1986), (2) a penalty of 3.0 for each gap and an additional 0.10 penalty for each symbol in each gap; and (3) no penalty for end gaps. Alternatively, percent complementarity can be assessed using the Bestfit program (Wisconsin Sequence Analysis Package, Version 8 for Unix, Genetics Computer Group, University Research Park, 575 Science Drive, Madison, Wis. 53711). Bestfit uses the local homology algorithm of Smith and Waterman, Advances in Applied Mathematics 2:482-489 (1981), to find the best segment of homology between two sequences. When using Bestfit or any other sequence alignment program to determine whether a particular sequence is, for instance, 95% identical to a reference sequence according to the present invention, the parameters are set, of course, such that the percentage of identity is calculated over the full length of the reference nucleotide sequence and that gaps in homology of up to 5% of the total number of nucleotides in the reference sequence are allowed.

Two DNA segments that have similar or identical sequences on opposite DNA strands are referred to as “inverted repeats.” Transcription through a region with inverted DNA repeats produces RNA segments that are “complementary” to each other. A transcript that comprises two complementary segments of RNA can form a single RNA molecule with double-stranded regions. Such double-stranded regions are sometimes called “stem-loops” or “hairpins.”

By “transcription terminator” is meant a segment of DNA that encodes the 3′-end of an RNA transcript that causes RNA polymerase to halt or retard transcription. Because most eukaryotic mRNAs have poly(A) segments added to their 3′-ends, most transcription terminators specify a base or bases to which adenosyl residues are added. Thus, a transcription terminator can comprise DNA encoding at least a portion of the 3′-UTR of an mRNA immediately adjacent to and including the nucleotide(s) to which a poly(A) tail is added. A transcription terminator additionally can comprise at least a portion of the DNA sequence immediately after the site(s) of polyadenylation to provide a more complete DNA context for the transcription stop site. Transcription terminators also include segments that halt transcription other than terminators for polyadenylation such as transcription terminators for histone genes or ribosomal RNA genes.

DNA constructs, as used herein, also encompass vectors. The term “vector” refers to a DNA molecule capable of autonomous replication in a host cell. As known to those skilled in the art, a vector includes, but is not limited to, a plasmid, cosmid, phagemid, viral vectors, phage vectors, yeast vectors, mammalian vectors and the like. Typically, vectors will include a gene coding for a drug resistance marker, a thymidine kinase gene or a gene that complements an auxotroph. Various antibiotic resistance genes have been incorporated into vectors for the purpose of aiding selection of host cell clones containing such vectors. For example, antibiotic resistance genes incorporated into vectors intended for introduction into bacterial host cells include, but are not limited to, a gene that confers resistance to an antibiotic selected from the group consisting of ampicillin, kanamycin, tetracycline, neomycin, G418, blastocidin S and chloramphenicol. Genes for complementing an auxotroph are genes encoding enzymes or proteins which facilitate usage of nutritional or functional components by the host such as a purine, pyrimidine, amino acid (e.g., lysine, tryptophan, histidine, leucine, cysteine), or sphingolipid.

Additionally, vectors will include an origin of replication (replicons) for a particular host cell. For example, various prokaryotic replicons are known to those skilled in the art, and function to direct autonomous replication and maintenance of a recombinant molecule in a prokaryotic host cell.

The term “operably linked” refers to the chemical fusion, ligation, or synthesis of DNA such that a promoter-DNA sequence combination is formed in a proper orientation for the DNA sequence to be transcribed into an RNA segment. Transcription from the promoter-DNA sequence can be regulated by the promoter, possibly in combination with other regulatory elements. Alternatively, transcription from the promoter-DNA segment may not be regulated by the promoter. In the construction of the promoter-DNA sequence combination, it is generally preferred to position the promoter at a distance upstream from the initial codon of the DNA segment that is approximately the same as the distance between the promoter and the segment it controls in its natural setting. However, as known in the art, substantial variation in the distance can be accommodated without loss of promoter function.

The term “promoter” denotes a nucleotide sequence, natural or synthetic, capable of binding RNA polymerase to initiate transcription. Such promoters are known to those skilled in the art and may include bacterial, viral, fungal, plant, mammalian, or other eukaryotic promoters, the selection of which depends on the host cell or organism being transformed. It is expected that silencing of the target gene will be most effective when the suppressing construct is transcribed in the same tissue as the target gene. Although there is evidence that the silencing signal can be translocated to distant parts of a plant (e.g., Palauqui and Vaucheret, 1998, PNAS 95: 9675-9680.), some cells may not be able to receive such a signal. For example, GFP expression at the very tip of the growing shoot was not silenced by a viral suppression construct (Dalmay et al., 2000, Plant Cell 12: 369-379.). To achieve silencing of a gene expressed in many types of cells, a constitutive promoter of at least moderate strength is preferred. Examples of constitutive promoters that act in plants are viral promoters such as CaMV 35S or FiMV (Sanger et al., 1990. Plant Mol. Biol. 14: 433-443), bacterial promoters such as nopaline synthase (nos) or mannopine synthase (mas), or plant promoters such as those from the Arabidopsis ACTIN2 or UBIQUITIN10 genes (An et al., 1996, Plant J. 10: 107-121; Norris et al., 1993, Plant Mol. Biol. 21: 895-906). Target genes with limited expression patterns also can be silenced using a constitutive promoter to drive the suppression construct. However, it may be desirable to avoid expression of the suppression construct beyond what is necessary for the silenced phenotype. A promoter for the suppression construct could be used that has a pattern of expression similar to that of the target gene. For example, if silencing of a xylem-expressed target is planned, the promoter from the parsley 4CL gene (Hauffe et al., 1993, Plant J. 4: 235-253) could be used, or if a meristem-specific gene is targeted, the Arabidopsis PROLIFERA promoter (Springer et al., 1995, Science 268: 877-880) could be used. In one embodiment, the promoter is derived from a different species than the species being transformed, to avoid interactions between identical promoter sequences. Various other promoters for expression in eukaryotic cells are known in the art, including, but not limited to, viral or viral-like basal promoters like the SV40 late promoter and the RSV promoter, and fungal or mammalian cellular promoters (see, e.g., Larsen et al., 1995, Nucleic Acids Res. 23:1223-1230; Donis et al., 1993, BioTechniques 15:786-787; Donda et al., 1993, Mol. Cell. Endocrinol. 90:R23-26; and Huper et al., 1992, In Vitro Cell Dev. Biol. 28A:730-734). Various replicons are known to those skilled in the art that function in eukaryotic cells to direct replication and maintenance of a recombinant molecule, of which it is part of, in a eukaryotic host cell.

The term “regulatory element” refers to nucleic acid sequences that affect the specificity or efficiency of DNA transcription or mRNA translation including, but not limited to, binding sites for transcription factors, enhancers, and transcription or translation initiation and termination signals. Enhancer sequences are DNA elements that appear to increase transcriptional efficiency in a manner relatively independent of their position and orientation with respect to a nearby DNA segment. Thus, depending on the DNA construct, an enhancer may be placed either upstream or downstream from a particular DNA segment to increase transcriptional efficiency. Such regulatory elements may be inserted into construct DNA sequences using recombinant DNA methods known in the art. Other regulatory elements include, but are not limited to, a 5′ untranslated region (5′UTR) on the RNA segment as well as a 3′UTR (i.e., comprising the poly(A) tail) on the RNA segment, which are necessary for stability and efficient translation of the RNA segment or transcript.

As used herein, a “cassette” is a type of DNA construct comprising a promoter, a transcription terminator, and the DNA segments inserted between them. A cassette can be used to drive the expression of DNA or RNA segments in host cells or organisms in which the promoter is active.

The term “substantial sequence identity” describes the relatedness of two or more nucleotide sequences. Preferably, the sequences are at least 80% identical to each other, as calculated above. More preferably, the identity should be at least 85%, 90%, 95%, 96%, 97%, 98%, 99% or even 100%.

“About” will be understood by persons of ordinary skill in the art and will vary to some extent on the context in which the term is used. If there are uses of the term which are not clear to persons of ordinary skill in the art given the context in which it is used, “about” will mean up to plus or minus 10% of the particular term.

Discussion

In one aspect of the invention, DNA constructs are provided that are useful for modulating the lignin content in plants. In one embodiment, a DNA construct comprises a promoter operably linked to a first DNA segment that corresponds to at least a portion of a 4-coumarate co-enzyme A ligase (4CL) gene, a spacer DNA segment, and a second DNA segment that is complementary to the first DNA segment, wherein the first and second DNA segments are arranged in a 5′ to 3′ direction, respectively, in the DNA construct.

A constitutive promoter, such as superubiquitin from P. radiata (U.S. Pat. No. 6,380,459, which is hereby incorporated by reference), can be used to drive the expression of the target 4CL or other lignin biosynthesis gene. In another embodiment, a DNA construct of the present invention comprises a promoter that directs expression specifically to the xylem. A promoter fragment isolated from the region upstream of the 4CL gene in P. taeda (U.S. Pat. No. 6,252,135, which is hereby incorporated by reference.) is one example of a promoter that shows strong xylem-preferred expression. Experimental evidence described herein demonstrates that the use of a 4CL promoter in the inventive DNA constructs effectively reduces the lignin content while not adversely impacting plant height.

The first and second DNA segments of the inventive constructs can be derived from any 4CL gene. In a preferred embodiment, when modifying the lignin content in pine or eucalyptus trees, the first and second DNA segments are derived from the 4CL gene from Pinus radiata (pine) (U.S. Patent Application Publication 20030131373) or the 4CL gene from E. grandis (U.S. Pat. No. 6,410,718). Similarly, the first and second DNA segments of the inventive constructs can be derived from any portion of a 4CL gene. For example, fragments of about 50 bp, 100 bp, 200 bp, 400 bp, 600 bp or 1000 bp can be used. Other exemplary lengths shown herein include 189 bp, 327 bp, 334 bp, 373 bp, 389 bp and 668 bp. In preferred embodiments, the first DNA segment comprises a fragment selected from the sequences depicted in Figure SEQ ID NOS. 18, 19, 20, 21, 22, 23, 24, 33 and 48.

The first DNA segment can be derived from either the sense strand or the antisense strand of a 4CL gene. As the second DNA segment is complementary to the first DNA segment and therefore derived from the opposing strand, the strand selection for the first DNA segment necessarily affects the source of the second DNA segment.

As noted above, a spacer DNA segment codes for a spacer RNA segment which serves to separate other RNA segments. A spacer RNA segment functions in the present invention as the loop in the stem-loop resulting from transcription of the DNA cassette of the inventive constructs. A spacer DNA segment can be completely synthetic or derived from a natural DNA sequence. In one embodiment, the spacer DNA segment is derived from an intron. Exemplary spacer DNA segments are shown in FIG. 1.

Previously identified genes of interest, or portions or promoters thereof can be isolated using methods and techniques designed for the manipulation of nucleic acid molecules, which are well known in the art. For example, methods for the isolation, purification and cloning of nucleic acid molecules, as well as methods and techniques describing the use of eukaryotic and prokaryotic host cells and nucleic acid and protein expression therein, are described by Sambrook, et al., Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbor, N.Y., 1989, and Current Protocols in Molecular Biology, Frederick M. Ausubel et al. Eds., John Wiley & Sons, Inc., 1987, the disclosure of which is hereby incorporated by reference.

The DNA constructs, including at least a portion of the gene or promoter of interest, can be introduced into host cells, which as stated previously, can be individual cells, cells in culture, cells as part of a host organism, a fertilized oocyte or gametophyte or an embryonic cell. The term “introduced” refers to standard procedures known in the art for delivering recombinant vector DNA into a target host cell. Such procedures include, but are not limited to, transfection, infection, transformation, natural uptake, electroporation, biolistics and Agrobacterium. Agrobacterium has been used successfully in a variety of species including poplars (Leple, J. C. et al. 1992. Plant Cell Rep. 11: 137-141.), eucalyptus (Tournier, V. et al. 2003. Transgenic Res. 12: 403-411.) and pine (U.S. Pat. No. 6,518,485 (biolistics) and US published patent application 20020100083). Agrobacterium are the only published methods for successfully getting regenerated plants of transgenic loblolly pine), Norway spruce (Wenck, A. R. et al. 1999. Plant Mol. Biol. 39: 407-416.), rice (Hiei, Y. et al. 1997. Plant Mol. Biol. 35: 205-218.; Cheng, X. et al. 1998. Proc. Natl. Acad. Sci. USA. 95: 2767-2772.), wheat (Cheng, M. et al. 1997. Plant Physiol. 115: 971-980.) and maize (Ishida, Y. et al. 1996. Nat Biotechnol. 14: 745-750.). Transformation has been utilized in species such as barley (Tingay, S. et al. 1997. Plant J. 11: 1369-1376.), sugarcane (Arencibia, A. D. et al. 1998. Transgenic Research 7: 1-10; Enriquez-Obregon, G. A. et al. 1998. Plant 206: 20-27.), banana (May, G. D. et al. 1995. Bio/Technology 13: 486-492.), Asparagus officinalis (Delbreil, B. et al. 1993. Plant Cell Rep. 12: 129-132.) and Agapanthus praecox (Suzuki, S. et al. 2001. Plant Sci. 161: 89-97.).

The efficacy of DNA constructs in modulating lignin content can be measured in a variety of ways. For example, acetyl bromide lignin determinations can be carried out on extractive free ground samples following the procedure used at the US Dairy Forage Research Center, Madison, Wis. (Fukushima, R. S. and Hatfield, R. D., J. Ag. Food Chem., 49(7):3133 (2001)). Pyrolysis molecular beam mass spectroscopy also can be used. The method consists of rapidly heating samples (0.1 g) in an inert, helium atmosphere at 500° C. The generated pyrolysis products are sampled directly in real time by expanding through a sampling orifice with subsequent formation of the molecular beam, which provides rapid sample quenching and inhibits sample condensation. The mass spectrometer provides universal detection of all sampled products and the molecular beam sampling ensures that representative products from the original molecules are detected (Magrini et al., Environmental Pollution, 116: 255-268 (2002)). In an another example, nuclear magnetic resonance (NMR) can be used to analyze lignin structure. NMR is an analytical method that can detect subatomic and structural information of molecules by measuring the adsorption of radio-frequency electromagnetic radiation by nuclei under the influence of a magnetic field. Typically, 1H and 13C are the two main nuclei used to characterize underivatized lignin, following the method of Li, S. and K. Lundquist (Nordic Pulp and Paper Research J., 3. 191-195)).

The reduction in lignin levels and the possible associated increase in CHO levels of trees can be both an economic an environmental advantage for the pulp industry. The reduction of lignin in tress should lead to the reduction of chemicals required to make pulp and possibly even a reduction in the amount of chemicals required to bleach the pulp.

The following examples serve to illustrate various embodiments of the present invention and should not be construed, in any way, to limit the scope of the invention.

EXAMPLES Example 1 Construction of cDNA Libraries

To identify monolignol synthesis, monolignol transport, and lignin polymerization gene candidates in P. radiata and E. grandis databases, cDNA sequences were compared to public domain sequences (by SWISS-PROT/TrEMBL ID's) to search against the pine and eucalyptus databases (non-redundant by contig, expect <1.0e-2).

The contig consensus DNA and protein sequences were then obtained for these hits, and duplicate sequences were identified. A multiple alignment was then carried out with the protein sequences. The protein alignment was created using the remaining pine and eucalyptus sequences along with the Arabidopsis members. From the protein alignment, a dendogram was created. These sequences were analyzed by primer walking to provide a full length sequence (best HT pick from the contig analyzed for full length sequence).

The public domain monolignol synthesis, monolignol transport, and lignin polymerization gene sequences from maize, cotton, rice, and poplar were also extracted and blasted against the pine and eucalyptus databases. The completed primer walked pine and eucalyptus sequences were also blasted against ownseq and the top 500 hits were taken. This was done so that the sequences could be used to search further and ensure that nothing in the pine and eucalyptus databases had been missed by using the Arabidopsis superfamily. This search resulted in an additional 4 sequences which were not found in the previous searches. These sequences were then also sent for primer walked full length sequence.

After removing a small number of additional duplicates after primer walking, pine and eucalyptus primer walked monolignol synthesis, monolignol transport, and lignin polymerization superfamily members were identified. The classification of these sequences was confirmed by alignment with ClustalX, the corresponding dendogram, and MEME/MAST analysis.

To identify additional sequence 5′ or 3′ of a partial cDNA sequence in a cDNA library, 5′ and 3′ rapid amplification of cDNA ends (RACE) was performed using the SMART RACE cDNA amplification kit (Clontech Laboratories, Palo Alto, Calif.). Generally, the method entailed first isolating poly(A) mRNA, performing first and second strand cDNA synthesis to generate double stranded cDNA, blunting cDNA ends, and then ligating of the SMART RACE. Adaptor to the cDNA to form a library of adaptor-ligated ds cDNA. Gene-specific primers were designed to be used along with adaptor specific primers for both 5′ and 3′ RACE reactions. Using 5′ and 3′ RACE reactions, 5′ and 3′ RACE fragments were obtained, sequenced, and cloned. The process may be repeated until 5′ and 3′ ends of the full-length gene were identified. A full-length cDNA may generated by PCR using primers specific to 5′ and 3′ ends of the gene by end-to-end PCR.

For example, to amplify the missing 5′ region of a gene from first-strand cDNA, a primer was designed 5′→3′ from the opposite strand of the template sequence, and from the region between ˜100-200 bp of the template sequence. A successful amplification should give an overlap of ˜100 bp of DNA sequence between the 5′ end of the template and PCR product.

RNA was extracted from four pine tissues, namely seedling, xylem, phloem and structural root using the Concert Reagent Protocol (Invitrogen, Carlsbad, Calif.) and standard isolation and extraction procedures. The resulting RNA was then treated with DNase, using 10 U/μl DNase I (Roche Diagnostics, Basel, Switzerland). For 100 μg of RNA, 9 μl 10× DNase buffer (Invitrogen, Carlsbad, Calif.), 10 μl of Roche DNase 1 and 90 μl of Rnase-free water was used. The RNA was then incubated at room temperature for 15 minutes and 1/10 volume 25 mM EDTA is added. A RNeasy mini kit (Qiagen, Venlo, The Netherlands) was used for RNA purification according to manufacturer's protocol.

To synthesize cDNA, the extracted RNA from xylem, phloem, seedling and root was used and the SMART RACE cDNA amplification kit (Clontech Laboratories Inc, Palo Alto, Calif.) was followed according to manufacturer's protocol. For the RACE PCR, the cDNA from the four tissue types was combined. The master mix for PCR was created by combining equal volumes of cDNA from xylem, phloem, root and seedling tissues. PCR reactions were performed in 96 well PCR plates, with 1 ml of primer from primer dilution plate (10 mM) to corresponding well positions. 49 ml of master mix is aliquoted into the PCR plate with primers. Thermal cycling commenced on a GeneAmp 9700 (Applied Biosystems, Foster City, Calif.) at the following parameters:

94° C. (5 sec),

72° C. (3 min), 5 cycles;

94° C. (5 sec),

70° C. (10 sec),

72° C. (3 min), 5 cycles;

94° C. (5 sec),

68° C. (10 sec),

72° C. (3 min), 25 cycles.

cDNA was separated on an agarose gel following standard procedures. Gel fragments were excised and eluted from the gel by using the Qiagen 96-well Gel Elution kit, following the manufacturer's instructions.

PCR products were ligated into pGEMTeasy (Promega, Madison, Wis.) in a 96 well plate overnight according to the following specifications: 60-80 ng of DNA, 5 μl 2× rapid ligation buffer, 0.5 μl pGEMT easy vector, 0.1 μl DNA ligase, filled to 10 μl with water, and incubated overnight.

Each clone was transformed into E. coli following standard procedures and DNA was extracted from 12 clones picked by following standard protocols. DNA extraction and the DNA quality was verified on an 1% agarose gel. The presence of the correct size insert in each of the clones was determined by restriction digests, using the restriction endonuclease EcoRI, and gel electrophoresis, following standard laboratory procedures.

Example 2 Construction of Pine 4CL Expression Vectors

A series of recombinant constructs comprising at least a portion of a 4CL gene from loblolly pine were prepared and evaluated for their ability to reduce the lignin content in plants. In general, each DNA construct comprises a promoter operably linked to a first DNA segment that corresponds to at least a portion of a 4CL gene, a spacer DNA segment, and a second DNA segment that is complementary to the first DNA segment, wherein the first and second DNA segments are arranged in a 5′ to 3′ direction, respectively, in the DNA construct. Eleven constructs were designed and prepared using different fragments of the 4CL gene Pinus radiata (FIG. 5) and different promoters. The general designs for the constructs are depicted in FIGS. 7 to 9. The superubiquitin promoter (U.S. Pat. No. 6,380,459, Ranjan J Perera et al., Plant & Animal Genome VIII Conference (2000)) was used as a constitutive promoter, while a 4CL promoter from P. taeda (U.S. Pat. No. 6,252,135) was used as a vascular-preferred promoter. An intron from the YABBY gene (SEQ ID NO:64) from Arabidopsis thaliana (Foster T M et al., Plant Cell, 14 (7): 1497-1508 (2002)) was used as a spacer DNA segment. The constructs utilized portions of the 4CL gene from P. radiata depicted in FIG. 5. FIGS. 6A-6B provide the nucleic acid sequences of the 4CL RNAi fragments (A to H) (SEQ ID NOS: 18-24) utilized in the constructs.

A backbone vector was prepared by adding additional restriction endonuclease sites to the multiple cloning site of the plasmid pBluescript (BRL Gibco Life Technologies, Gaithersburg Md.). The NotI and SstI sites in the original pBluescript vector were destroyed by digestion of the plasmid with NotI and SstI and filling in the ends using Klenow and T4 Polymerase (Invitrogen Corp., Carlsbad Calif.). The plasmid was circularized by blunt-end ligation and then digested with the restriction endonucleases EcoRI and HindIII to enable cloning of linkers. Linkers (phosphorylated at the 5′ end) containing additional restriction sites (given in SEQ ID NOS: 1 and 2) were annealed together and ligated into the EcoRI/HindIII-digested pBluescript vector.

The 3′ UTR from the P. radiata superubiquitin gene (U.S. Pat. No. 6,380,459) was cloned into the plasmid pBI-121 (Jefferson et al., EMBO J. 6:3901-3907, 1987). First, a fragment of the 3′ UTR of the gene was amplified using standard PCR techniques and the primers given in SEQ ID NOS: 3 and 4. These primers contained additional nucleotides to provide an SstI restriction site for cloning into SstI-digested plasmid pBI-121. Then, the 3′ UTR fragment, containing the nos terminator, was transferred to the pBluescript plasmid. The 3′ UTR and nos terminator fragment of pBI-121 was amplified with PCR using primers given in SEQ ID NOS: 5 and 6, cleaved with KpnI and Cla1 and cloned into the modified pBluescript digested with KpnI and ClaI.

To this construct, the P. radiata superubiquitin promoter sequence with intron was added. The promoter/intron sequence was first amplified from the P. radiata superubiquitin sequence identified in U.S. Pat. No. 6,380,459 using standard PCR techniques and the primers of SEQ ID NOS: 7 and 8. The amplified fragment was then ligated into the base vector using XbaI and PstI restriction digestion.

The P. radiata 4CL intron sequence (SEQ ID NO: 9) from the P. radiata cDNA was amplified using standard PCR techniques and the primers of SEQ ID NOS: 10 and 11, then cloned into XcmI-digested vector backbone using T-tailed ligation.

To isolate and characterize monolignol synthesis, monolignol transport, and lignin polymerization and monolignol synthesis, monolignol transport, and lignin polymerization-like genes from E. grandis and P. radiata, total RNA was extracted from plant tissue (using the protocol of Chang et al., Plant Mol. Biol. Rep. 11:113-116 (1993). Plant tissue samples were obtained from phloem (P), cambium (C), expanding xylem (X1), and differentiating and lignifying xylem (X2).

mRNA was isolated from the total RNA preparation using either a Poly(A)

Quik mRNA Isolation Kit (Stratagene, La Jolla, Calif.) or Dynal Beads Oligo (dT)25 (Dynal, Skogen, Norway). cDNA expression libraries were constructed from the purified mRNA by reverse transcriptase synthesis followed by insertion of the resulting cDNA clones in Lambda ZAP using a ZAP Express cDNA Synthesis Kit (Stratagene), according to the using the manufacturer's protocol. The resulting cDNAs were packaged using a Gigapack II Packaging Extract (Stratagene) using an aliquot (1-5 μL) from the 5 mL ligation reaction dependent upon the library. Mass excision of the library was done using XL1-Blue MRF′ cells and XLOLR cells (Stratagene) with ExAssist helper phage (Stratagene). The excised phagemids were diluted with NZY broth (Gibco BRL, Gaithersburg, Md.) and plated out onto LB-kanamycin agar plates containing X-gal and isopropylthio-beta-galactoside (IPTG).

Of the colonies plated and selected for DNA miniprep, 99% contained an insert suitable for sequencing. Positive colonies were cultured in NZY broth with kanamycin and cDNA was purified by means of alkaline lysis and polyethylene glycol (PEG) precipitation. Agarose gel at 1% was used to screen sequencing templates for chromosomal contamination. Dye primer sequences were prepared using a Turbo Catalyst 800 machine (Perkin Elmer/Applied Biosystems Division, Foster City, Calif.) according to the manufacturer's protocol.

DNA sequence for positive clones was obtained using a Perkin Elmer/Applied Biosystems Division Prism 377 sequencer. cDNA clones were sequenced first from the 5′ end and, in some cases, also from the 3′ end. For some clones, internal sequence was obtained using either Exonuclease III deletion analysis, yielding a library of differentially sized subclones in pBK-CMV, or by direct sequencing using gene-specific primers designed to identified regions of the gene of interest.

Using the methods described in Example 1, a Pinus radiata cDNA expression library was constructed from xylem and screened. DNA sequences for positive clones were obtained using forward and reverse primers on a Perkin Elmer/Applied Biosystems Prism 377 sequencer and the determined sequences were compared to known sequences in the EMBL database as described above. Based on similarity to known sequences from other plant species, the isolated DNA sequences were identified as encoding 4CL (SEQ ID NOS: 18 24) and caffeoyl CoA methyl transferase (SEQ ID NO:44).

A fragment from a P. radiata 4CL cDNA clone was amplified using standard PCR techniques and primers SEQ ID NOS: 12 and 13. The primers were designed to add PstI and ClaI restriction sites to both ends of the amplified fragments. The nucleotide sequence of the amplified fragment is provided as SEQ ID NO: 24. To clone the P. radiata 4CL fragment in the sense orientation, the amplified fragment was cut with the restriction enzyme PstI, blunt ended using Klenow and cloned into the backbone vector in a blunt-ended ClaI site. To clone the P. radiata 4CL fragment in the antisense orientation, the amplified fragment was digested with PstI and cloned into the PstI-digested backbone vector.

The yabby intron sequence (Foster et al. 2002, Plant Cell. 14 (7): 1497-1508) was amplified using primers similarly designed to those above for the Pr4CL and PDK intron sequences and cloned into the vector backbone as described above. Six additional fragments (SEQ ID NOS: 18-23) were amplified with primers similarly designed to those used for SEQ ID NO 24, except that primers for SEQ ID NO 18 were designed to add a SmaI restriction sites to both ends of the amplified fragment, primers for SEQ ID NO 19 were designed to add EcoRI and HindIII restriction sites at both ends of the amplified fragment, the primers for SEQ ID NO 22 were designed to add PstI restriction sites at both ends of the amplified fragment. The primers for SEQ ID NO 23 were designed to add a SmaI restriction site to the one end and EcoRI and HindIII restriction sites to the other end of the amplified fragment. All seven fragments were cloned in the sense and antisense directions into the backbone vector as described above or by using the listed restriction enzymes. The complete RNAi cassette containing the promoter::sense fragment::intron::antisense fragment::3′UTR::nos terminator construct, was removed from the pBluescript plasmid as described above, and cloned into the binary vector pART27 or pART29 (digested with NotI) using standard cloning techniques. The binary vector pART29 is a modified pART27 vector (Gleave, Plant Mol. Biol. 20:1203-1207, 1992) that contains the Arabidopsis thaliana ubiquitin 3 (UBQ3) promoter instead of the nos5′ promoter and no lacZ sequences.

The complete RNAi cassette (SEQ ID NO: 14) containing the promoter::sense fragment::intron::antisense fragment::3′UTR::nos terminator construct, was removed from the pBluescript plasmid by a NotI restriction digestion, and cloned into the binary vector pART29 (digested with NotI) using standard cloning techniques to produce the final vector pARB513.

The constructs were re-engineered for use in pine by removing the NotI fragments and inserting these into a base vector that had a NotI site as well as a constitutive promoter expression GUS, to allow verification of transformation without PCR, and a selectable marker cassette comprising nptII driven by the Arabidopsis Ubq10 promoter. The promoter::4CL RNAi cassette was removed from each of the vectors listed in Table 1 in the “Engineered from” column using the restriction enzyme NotI. The vector pWVR31 was linearized using the restriction enzyme NotI and treated with SAP to prevent it from reannealing to itself. Each fragment was ligated into pWVR31 at the NotI site to produce the vectors listed in Table 1.

TABLE 1 Re-engineered Construct number Engineered from pWVC60 pARB318 pWVC62 pARB319 pWVK158 pARB320 pWVK154 pARB321 pWVK157 pARB322 pWVK155 pARB323 pWVK143 pARB332 pWVC42 pARB333 pWVC46 pARB334 pWVC40 pARB335 pWVC43 pARB336 pWVC44 pARB337 pWVC45 pARB338

Constructs pWVK154, pWVK143, pWVC46 and pWVC40 were deposited with the American Type Culture Collection, P.O. Box 1549, Manassas, Va., USA, 20108 on Sep. 21, 2004, and accorded ATCC Accession Nos. PTA-6229, PTA-6228, PTA-6227, and PTA-6226, respectively.

The control vectors pWVC41 and pWVK159 were developed by cloning the 4CL promoter from P. taeda (U.S. Pat. No. 6,252,135) and the superubiquitin gene from P. radiata (U.S. Pat. No. 6,380,459) respectively, together with the GUS (intron) gene (reference) into the vector pWVR31. The backbone vector pWVR5 is a pBI121 vector (Clontech laboratories, Palo Alto Calif.) with the 35S promoter GUS sequence removed and the NOS promoter replaced with the UBQ10 promoter from Arabidopsis (Sun, C. W & Callis, J (1997) Plant J., 11:101-111). To make the vector pWVR8 the ActinII promoter (MEAGHER, Int. Rev. Cytol., 125:139-163 (1991)) was amplified and cloned into the pWVR5 vector together with the GUS plus intron gene (Ohta et al., Plant Cell Physiol, 31:805-813 (1990)).

The backbone vector pWVR31 was engineered from the vector pWVR8 (Arabidopsis ActinII::GUSINT, UBQ10::NPTII). The UBQ11 promoter from Arabidopsis (Norris S R, et al. (1993) Plant Mol Biol. 21(5):895-906) was amplified by PCR using primers, and this was used to replace the ActinII promoter from pWVR8 to make the vector pWVR31.

In addition, the vectors listed in Table 2 were constructed as described above but with modifications in at least one of the following sequences: the promoter and/or the binary vector. To clone a different promoter as listed in Table 2 into the final vector, the P. radiata superubiquitin promoter intron vector was digested with SmaI and SstI restriction enzymes and using standard techniques this fragment was cloned into Bluescript vectors containing either a 4CL promoter from P. taeda, an COMT promoter from Eucalyptus grandis, or a LIM promoter from P. radiata, using standard techniques. The P. taeda 4CL promoter (U.S. Pat. No. 6,252,135), the E. grandis COMT promoter (U.S. patent Ser. No. 10/703,091), and the P. radiata LIM promoter (U.S. patent application Ser. No. 10/717,897) were all amplified using primers similarly designed to those used to amplify the P. radiata superubiquitin promoter sequence with intron described above and then ligated into the base Bluescript vector as described above. The complete RNAi cassette containing the promoter::sense fragment::intron::antisense fragment::3′UTR::nos terminator construct, was removed from the pBluescript plasmid by a NotI restriction digestion and cloned into the binary vector pART29 or pWVK147 (digested with NotI) using standard cloning techniques. The pWVK147 vector is a pBI121 vector (Clontech laboratories, Palo Alto Calif.) with the 35S promoter GUS sequence removed and the NOS promoter replaced with the UBQ10 promoter from Arabidopsis (Sun, C. W & Callis, J (1997) Plant J, 11:101-111) to drive the nptII gene. A unique HpaI restriction site was added to the vector by the addition of an adapter ligated at the ApaI and KpnI sites.

TABLE 2 Base Binary Vector into which final cassette Final was Promoter driving the 4CL RNAi cassette Vector inserted containing the P. radiata 4CL intron as spacer pARB553 pWVK147 Pinus radiata SuperUbiq + Intron (SEQ ID NO: 76) pARB555 pWVK147 Pinus taeda 4CL (SEQ ID NO: 77) pARB561 pWVK147 Eucalyptus grandis COMT 485 bp fragment of U.S. Patent Publication No. 20040146904 pARB562 pWVK147 Pinus radiata LIM 1607 bp fragment of U.S. Patent publication No. 20040163146 pARB515 pART29 Pinus taeda 4CL (SEQ ID NO: 77) pARB534 pART29 Pinus radiata LIM 1607 bp fragment of U.S. Patent publication No. 20040163146

The vectors listed in Table 3 were constructed using the same methods as those described above, except that the primers SEQ ID NOS: 16 and 17 were used to amplify the PDK intron sequence (Wesley et al., Plant J. 27:581-590, 2001) (SEQ ID NO: 15) using

TABLE 3 Base Binary Vector into which final Final cassette was Promoter driving the 4CL RNAi cassette Vector inserted containing the PDK intron as spacer pARB554 pWVK147 Pinus radiata SuperUbiq + Intron (SEQ ID NO: 76) pARB556 pWVK147 Pinus taeda 4CL (SEQ ID NO: 77) pARB557 pWVK147 Eucalyptus grandis COMT 485 bp fragment of U.S. Patent Publication No. 20040146904 pARB558 pWVK147 Pinus radiata LIM 1607 bp fragment of U.S. Patent publication No. 20040163146 pARB514 pART29 Pinus radiata SuperUbiq + Intron (SEQ ID NO: 76) pARB516 pART29 Pinus taeda 4CL (SEQ ID NO: 77) pARB518 pART29 Pinus radiata LIM 1607 bp fragment of U.S. Patent publication No. 20040163146

Example 3 Construction of Eucalyptus 4CL Expression Vectors

A series of recombinant constructs comprising at least a portion of a 4CL gene were prepared as described above and evaluated for their ability to reduce the lignin content in plants. In general, each DNA construct comprises a promoter operably linked to a first DNA segment (SEQ ID NO: 21) that corresponds to at least a portion of a 4CL gene from Eucalyptus grandis (U.S. Pat. No. 6,410,718) a spacer DNA segment, and a second DNA segment that is complementary to the first DNA segment, wherein the first and second DNA segments are arranged in a 5′ to 3′ direction, respectively, in the DNA construct. Initially, three constructs were prepared using different fragment lengths of the 4CL gene and different promoters. See Table 11. The general design for the constructs is depicted in FIG. 7. The superubiquitin promoter (U.S. Pat. No. 6,380,459; Ranjan J Perera et al., Plant & Animal Genome VIII Conference (2000)) was used as a constitutive promoter, while the promoter from 4CL gene in P. taeda SEQ ID 77 was used as a vascular-preferred promoter. An intron from the YABBY gene from Arabidopsis thaliana (Foster T M et al., Plant Cell, 14 (7): 1497-1508 (Plant Cell)) was used as a spacer DNA segment. FIGS. 2A & 2B provide the nucleic acid sequences of the 4CL RNAi 200 bp fragment (SEQ ID NO:33) and 4CL RNAi 600 bp fragment (SEQ ID NO:34).

The construction of the backbone vector was as described in Example 2. A fragment from E. grandis 4CL cDNA clone (U.S. Pat. No. 6,410,718) was amplified using standard PCR techniques and primers given in SEQ ID NOS: 25 and 26. The primers were designed to add PstI and ClaI restriction sites to both ends of the amplified fragments. The nucleotide sequence of the amplified fragment is given in SEQ ID NO: 27. To clone the 4CL fragment in the sense orientation, the amplified fragment was cut with the restriction enzyme PstI, and cloned into the backbone vector. To clone the 4CL fragment in the antisense orientation, the amplified fragment was digested with ClaI and cloned into the backbone vector.

The complete RNAi cassette (SEQ ID NO: 32) containing the promoter::sense fragment::intron::antisense fragment::3′UTR::nos terminator construct, was removed from the pBluescript plasmid by a NotI restriction digestion, and cloned into the binary vector pART29 (digested with NotI) as described in Example 2 to produce the final vector pAB583.

The final vectors listed in Table 4 were constructed by amplifying four additional fragments (Seq ID NOS 28-31) with primers similarly designed to those used for the fragment in the example above. All five fragments were cloned in the sense and antisense directions into the backbone vector as described above before the complete RNAi cassettes were cloned into pART29 as described above.

TABLE 4 Fragment cloned in forward and reverse orientation for Intron used as Final Vector RNAi spacer pARB584 SEQ ID NO: 28 SEQ ID NO: 9 pARB585 SEQ ID NO: 29 SEQ ID NO: 9 pARB586 SEQ ID NO: 30 SEQ ID NO: 9 pARB587 SEQ ID NO: 31 SEQ ID NO: 9

The vectors listed in Table 5 were constructed using the same methods as those described above, except that the primers SEQ ID NOS: 16 and 17 were used to amplify the PDK intron sequence (Wesley et al., Plant J. 27:581-590, 2001) (SEQ ID NO: 15) using standard PCR techniques.

TABLE 5 Fragment cloned in forward and reverse orientation for Intron used as Final Vector RNAi spacer pARB578 SEQ ID NO: 27 SEQ ID NO: 15 pARB579 SEQ ID NO: 28 SEQ ID NO: 15 pARB580 SEQ ID NO: 29 SEQ ID NO: 15 pARB581 SEQ ID NO: 30 SEQ ID NO: 15 pARB582 SEQ ID NO: 31 SEQ ID NO: 15

The vectors listed in Table 6 were constructed as described in Example 2 together with the following changes. The yabby intron sequence (Foster et al. 2002, Plant Cell. 14 (7): 1497-1508) was amplified using primers similarly designed to those for the Pr4CL and PDK intron sequences and cloned into the vector backbone as described in Example 2. The fragment inserts SEQ ID NOS:33 and 34 were amplified with primers similarly designed to those used for the fragments SEQ ID NOS 27-31 in the example above. Substitutions of the promoter from the Pinus radiata Superubiquitin promoter plus intron for the P. taeda 4CL promoter were done as described in Example 2 where so designated in Table 6 below. The listed fragment insert and promoter were cloned into the final vector as described above in Example 2 before the complete RNAi cassettes were cloned into pART27

TABLE 6 Fragment cloned in forward and reverse orientation around Final yabby intron spacer Vector Promoter driving RNAi cassette for RNAi pARB339 Pinus radiata SuperUbiq + Intron (SEQ SEQ ID NO: 33 ID NO: 76) pARB341 Pinus radiata SuperUbiq + Intron (SEQ SEQ ID NO: 34 ID NO: 76) pARB345 Pinus taeda 4CL (SEQ ID NO: 77) SEQ ID NO: 33 pARB347 Pinus taeda 4CL (SEQ ID NO: 77) SEQ ID NO: 34

The final vectors listed in Table 7 were constructed by removing the complete RNAi cassette containing the promoter::sense fragment::intron::antisense fragment::3′UTR::nos terminator construct from the pARB345 final vector listed above by a NotI restriction digestion, and cloning it into either the binary vector pARB1002 or pARB1005 (digested with NotI) using standard cloning techniques.

TABLE 7 Base Binary Vector into which RNAi cassette was Final Vector inserted pARB599 pARB1002 (SEQ ID NO: 61) pARB639 pARB1005 (SEQ ID NO: 63)

To modulate the lignin content in Eucalyptus plants, constructs comprising various combinations of promoters, first DNA segments and introns can be used. With a selection of constructs from which to choose, a practitioner can obtain plants with the desired amounts of lignin content and growth. Table 8 provides a variety of constructs useful in this regard.

TABLE 8 Promoter Fragment Intron Eucalyptus grandis Euc 4CL 200 bp fragment PDK COMT 485 bp U.S. Patent (1-200) Publication No. 20040146904 (SEQ ID NO: 27) Eucalyptus grandis Euc 4CL 223 bp fragment PDK COMT 485 bp U.S. Patent (201-423) Publication No. 20040146904 (SEQ ID NO: 28) Eucalyptus grandis Euc 4CL 300 bp fragment PDK COMT 485 bp U.S. Patent (551-850) Publication No. 20040146904 (SEQ ID NO: 29) Eucalyptus grandis Euc 4CL 336 bp fragment PDK COMT 485 bp U.S. Patent (1031-1378) Publication No. 20040146904 (SEQ ID NO: 30) Eucalyptus grandis Euc 4CL 500 bp fragment PDK COMT 485 bp U.S. Patent (1521-2020) Publication No. 20040146904 (SEQ ID NO: 31) Eucalyptus grandis Euc 4CL 200 bp fragment PDK COMT 306 bp of U.S. Patent (1-200) Publication No. 20040146904 (SEQ ID NO: 27) Eucalyptus grandis Euc 4CL 223 bp fragment PDK COMT 306 bp of U.S. Patent (201-423) Publication No. 20040146904 (SEQ ID NO: 28) Eucalyptus grandis Euc 4CL 300 bp fragment PDK COMT 306 bp of U.S. Patent (551-850) Publication No. 20040146904 (SEQ ID NO: 29) Eucalyptus grandis Euc 4CL 336 bp fragment PDK COMT 306 bp of U.S. Patent (1031-1378) Publication No. 20040146904 (SEQ ID NO: 30) Eucalyptus grandis Euc 4CL 500 bp fragment PDK COMT 306 bp of U.S. Patent (1521-2020) Publication No. 20040146904 (SEQ ID NO: 31) Pinus radiata LIM 1607 bp of Euc 4CL 200 bp fragment PDK U.S. Patent publication No. (1-200) 20040163146 (SEQ ID NO: 27) Pinus radiata LIM 1607 bp of Euc 4CL 223 bp fragment PDK U.S. Patent publication No. (201-423) 20040163146 (SEQ ID NO: 28) Pinus radiata LIM 1607 bp of Euc 4CL 300 bp fragment PDK U.S. Patent publication No. (551-850) 20040163146 (SEQ ID NO: 29) Pinus radiata LIM 1607 bp of Euc 4CL 336 bp fragment PDK U.S. Patent publication No. (1031-1378) 20040163146 (SEQ ID NO: 30) Pinus radiata LIM 1607 bp of Euc 4CL 500 bp fragment PDK U.S. Patent publication No. (1521-2020) 20040163146 (SEQ ID NO“ 31) P. taeda 4CL (SEQ ID NO: 77) Euc 4CL 200 bp fragment PDK (1-200) (SEQ ID NO“ 27) P. taeda 4CL (SEQ ID NO: 77) Euc 4CL 223 bp fragment PDK (201-423) (SEQ ID NO“ 28) P. taeda 4CL (SEQ ID NO: 77) Euc 4CL 300 bp fragment PDK (551-850) (SEQ ID NO“ 29) P. taeda 4CL (SEQ ID NO: 77) Euc 4CL 336 bp fragment PDK (1031-1378) (SEQ ID NO“ 30) P. taeda 4CL (SEQ ID NO: 77) Euc 4CL 500 bp fragment PDK (1521-2020) (SEQ ID NO“ 31) Eucalyptus grandis Euc 4CL 200 bp fragment Pr4CL COMT 485 bp U.S. Patent (1-200) Publication No. 20040146904 (SEQ ID NO“ 27) Eucalyptus grandis Euc 4CL 300 bp fragment Pr4CL COMT 485 bp U.S. Patent (551-850) Publication No. 20040146904 (SEQ ID NO“ 29) Eucalyptus grandis Euc 4CL 500 bp fragment Pr4CL COMT 485 bp U.S. Patent (1521-2020) Publication No. 20040146904 (SEQ ID NO“ 31) Eucalyptus grandis Euc 4CL 200 bp fragment Pr4CL COMT 306 bp of U.S. Patent (1-200) Publication No. 20040146904 (SEQ ID NO: 27) Eucalyptus grandis Euc 4CL 300 bp fragment Pr4CL COMT 306 bp of U.S. Patent (551-850) Publication No. 20040146904 (SEQ ID NO: 29) Eucalyptus grandis Euc 4CL 500 bp fragment Pr4CL COMT 306 bp of U.S. Patent (1521-2020) Publication No. 20040146904 (SEQ ID NO: 31) Pinus radiata LIM 1607 bp of Euc 4CL 200 bp fragment Pr4CL U.S. Patent publication No. (1-200) 20040163146 (SEQ ID NO: 27) Pinus radiata LIM 1607 bp of Euc 4CL 300 bp fragment Pr4CL U.S. Patent publication No. (551-850) 20040163146 (SEQ ID NO: 29) Pinus radiata LIM 1607 bp of Euc 4CL 500 bp fragment Pr4CL U.S. Patent publication No. (1521-2020) 20040163146 (SEQ ID NO: 29) Euc LIM of U.S. Patent Euc 4CL 200 bp fragment Pr4CL publication No. 20040163146 (1-200) (SEQ ID NO: 27) Euc LIM of U.S. Patent Euc 4CL 300 bp fragment Pr4CL publication No. 20040163146 (551-850) (SEQ ID NO: 29) Euc LIM of U.S. Patent Euc 4CL 500 bp fragment Pr4CL publication No. 20040163146 (1521-2020) (SEQ ID NO: 31) P. taeda 4CL (SEQ ID NO: 77) Euc 4CL 200 bp fragment Pr4CL (1-200) (SEQ ID NO: 27) P. taeda 4CL (SEQ ID NO: 77) Euc 4CL 300 bp fragment Pr4CL (551-850) (SEQ ID NO: 29) P. taeda 4CL (SEQ ID NO: 77) Euc 4CL 500 bp fragment Pr4CL (1521-2020) (SEQ ID NO: 31)

Example 4 Isolation of cDNAs of E. grandis CCoAOMT, C3H, C4H and CCR

Two Eucalyptus grandis cDNA expression libraries (one from a mixture of various tissues from a single tree and one from leaves of a single tree) were constructed and screened as follows.

mRNA was extracted from the plant tissue using the protocol of Chang et al. (Plant Molecular Biology Reporter 11:113-116, 1993) with minor modifications. Specifically, samples were dissolved in CPC-RNAXB (100 mM Tris-Cl, pH 8.0; 25 mM EDTA; 2.0 M NaCl; 2% CTAB; 2% PVP and 0.05% Spermidine*3 HCl) and extracted with chloroform:isoamyl alcohol, 24:1. mRNA was precipitated with ethanol and the total RNA preparation was purified using a Poly(A) Quik mRNA Isolation Kit (Stratagene, La Jolla, Calif.). A cDNA expression library was constructed from the purified mRNA by reverse transcriptase synthesis followed by insertion of the resulting cDNA clones in Lambda ZAP using a ZAP Express cDNA Synthesis Kit (Stratagene), according to the manufacturer's protocol. The resulting cDNAs were packaged using a Gigapack II Packaging Extract (Stratagene) employing 1 μl of sample DNA from the 5 μl ligation mix. Mass excision of the library was done using XL1-Blue MRF′ cells and XLOLR cells (Stratagene) with ExAssist helper phage (Stratagene). The excised phagemids were diluted with NZY broth (Gibco BRL, Gaithersburg, Md.) and plated out onto LB-kanamycin agar plates containing X-gal and isopropylthio-beta-galactoside (IPTG).

Of the colonies plated and picked for DNA miniprep, 99% contained an insert suitable for sequencing. Positive colonies were cultured in NZY broth with kanamycin and cDNA was purified by means of alkaline lysis and polyethylene glycol (PEG) precipitation. Agarose gel at 1% was used to screen sequencing templates for chromosomal contamination. Dye primer sequences were prepared using a Turbo Catalyst 800 machine (Perkin Elmer/Applied Biosystems, Foster City, Calif.) according to the manufacturer's protocol.

DNA sequences for positive clones were obtained using a Perkin Elmer/Applied Biosystems Prism 377 sequencer. cDNA clones were sequenced first from the 5′ end and, in some cases, also from the 3′ end. For some clones, internal sequence was obtained using subcloned fragments. Subcloning was performed using standard procedures of restriction mapping and subcloning to pBluescript II SK+ vector.

The determined cDNA sequences were compared to known sequences in the EMBL database (release 46, March 1996) using the FASTA algorithm of February 1996 (Version 2.0.4) or the BLAST algorithm Version 2.0.4 [Feb. 24, 1998], or Version 2.0.6 [Sep. 16, 1998]. Multiple alignments of redundant sequences were used to build up reliable consensus sequences. Based on similarity to known sequences from other plant species, the isolated polynucleotides of the present invention were identified as encoding a specified enzyme.

Using the procedures described above, cDNA sequences derived from the Eucalyptus grandis library encoding the following polypeptides were isolated: caffeoyl CoA methyl transferase (U.S. Pat. No. 6,410,718); cinnamate-4-hydroxylase (C4H) (U.S. Pat. No. 6,410,718); p-coumarate-3-hydroxylase (C3H) (U.S. Pat. No. 5,981,837) and CCR (U.S. Pat. No. 6,410,718).

Example 5 Construction of Pinus radiata LIM Expression Vectors

The final vectors listed in Table 9 were constructed as described in Example 2 with the following modifications; the use of different fragments, promoters and/or introns. Two fragments SEQ ID NOS: 38 &39) from the P. radiata LIM cDNA clone (patent application WO 00/53724) were amplified using standard PCR techniques and primers similarly designed to those used in Example 2. The P. radiata LIM fragments were cloned into the backbone vector in both the sense and antisense orientations as described in Example 2. Final vectors in Table 9 containing a different promoter to that contained in the backbone vector were constructed by making changes to the promoter similarly to that described in Example 2. The yabby intron was inserted into the final vectors using the method described in Example 2. The complete RNAi cassettes were cloned into pART27 or pART29 as described in examples 1 and 2.

TABLE 9 Binary Fragment Vector cloned in into which forward and the RNAi reverse cassette orientation Final was Promoter in RNAi Vector inserted driving the RNAi cassette cassette pARB348 pART27 Pinus radiata SuperUbiq + Intron SEQ ID (SEQ ID NO: 76) NO: 38 pARB352 pART27 Pinus taeda 4CL (SEQ ID NO: 77) SEQ ID NO: 38 pARB349 pART27 Pinus radiata SuperUbiq + Intron SEQ ID NO: (SEQ ID NO: 76) 39 pARB353 pART27 Pinus taeda 4CL (SEQ ID NO: 77) SEQ ID NO: 39 pARB235 pART29 Pinus radiata SuperUbiq + Intron SEQ ID NO: (SEQ ID NO: 76) 38 pARB236 pART29 Pinus radiata SuperUbiq + Intron SEQ ID NO: (SEQ ID NO: 76) 39 pARB243 pART29 Pinus taeda 4CL (SEQ ID NO: 77) SEQ ID NO: 38 pARB244 pART29 Pinus taeda 4CL (SEQ ID NO: 77) SEQ ID NO: 39

To utilize vectors based on pART27 in pine, the constructs must be re-engineered to remove the selection cassette nos::nptII. As described in Example 2, NotI fragments can be removed and inserted into a base vector that has a NotI site as well as a constitutive promoter expression GUS, to allow verification of transformation without PCR, and a selectable marker cassette comprising nptII driven by the Arabidopsis Ubq10 promoter. The vector pWVR31 can be used as a new base vector.

Example 6 Construction of Eucalyptus grandis LIM Expression Vectors

The construction of the backbone plasmid was as described in Example 2. Two fragments (SEQ ID NOS: 40 & 41) from E. grandis LIM cDNA clone (patent application WO00/53724) were amplified using standard PCR techniques and primers designed to add EcoRI and XbaI restriction sites to both ends of the amplified fragments. To clone the LIM fragments in the sense orientation, the amplified fragments were cut with the restriction enzymes EcoRI and XbaI, blunt ended using Klenow and cloned into the backbone vector containing the yabby intron and P. radiata superubiquitin promoter sequence (described in Example 2) in a blunt-ended ClaI site. To clone the LIM fragments in the antisense orientation, the amplified fragments were cut with the restriction enzymes EcoRI and XbaI, blunt ended using Klenow and cloned into the same backbone vector in a blunt-ended PstI site using standard cloning techniques.

The complete RNAi cassette containing the promoter::sense fragment::intron::antisense fragment::3′UTR::nos terminator construct, was removed from the backbone vector by a NotI restriction digestion, and cloned into the binary vector pART29 (digested with NotI) using standard cloning techniques. For final vectors containing a different promoter as listed in Table 10, the promoter sequence was substituted using the method described in Example 2. The vectors listed in Table 10 were constructed using this method.

TABLE 10 Fragment cloned in forward and reverse Final orientation in RNAi Vector Promoter driving the RNAi cassette cassette pARB489 Pinus radiata SuperUbiq + Intron (SEQ SEQ ID NO: 40 ID NO: 76) pARB490 Pinus radiata SuperUbiq + Intron (SEQ SEQ ID NO: 41 ID NO: 76) pARB491 Pinus taeda 4CL (SEQ ID NO: 77) SEQ ID NO: 40 pARB492 Pinus taeda 4CL (SEQ ID NO: 77) SEQ ID NO: 41

Example 7 Construction of Pine CCoAOMT Expression Vector

The following vector was cloned as described in Example 2, with the modification that a fragment from the Pine CCo-OMT (caffeoyl-coenzyme O-Methyltransferase) (SEQ ID NO: 42) clone was amplified with primers similarly designed to those used in Example 2 and used in a method in accordance to that described in Example 2. The final vector was also modified by the addition of the yabby intron and the use of the pART27 binary vector using the methods described in Example 2.

TABLE 11 Final Vector Promoter Fragment pARB357 Pinus radiata SuperUbiq + Intron (SEQ SEQ ID NO: 42 ID NO: 76)

To utilize the vector in pine, the construct must be re-engineered to remove the selection cassette nos::nptII. As described in Example 2, NotI fragments can be removed and inserted into a base vector that has a NotI site as well as a constitutive promoter expression GUS, to allow verification of transformation without PCR, and a selectable marker cassette comprising nptII driven by the Arabidopsis Ubq10 promoter. The vector pWVR31 can be used as a new base vector.

Example 8 Construction of Additional Pine CCoAOMT Expression Vectors

The following vectors were cloned as described in Example 3, with the modifications that a fragment from the Pine CCoAOMT (Caffeoyl-coenzyme A O-Methyltransferase) (SEQ ID NO: 43) clone (isolated in Example 4) was amplified with primers similarly designed to those used in Example 4 and used in a method in accordance to that described in Example 4. The final vectors were also modified by means of the addition of the PDK intron, the use of either the P. radiata Superubiquitin promoter with intron or the P. taeda 4CL promoter and the use of the pWVK147 binary vector using the methods described above.

TABLE 12 Final Vector Promoter Fragment pARB559 Pinus radiata SuperUbiq + Intron (SEQ ID SEQ ID NO: NO: 76) 43 pARB560 Pinus taeda 4CL (SEQ ID NO: 77) SEQ ID NO: 43

Example 9 Construction of E. grandis CCoAOMT Expression Vectors

The following vectors were cloned as described in Example 3, with the modifications that a fragment from the E. grandis CCoAOMT (Caffeoyl-coenzyme A O-Methyltransferase) (SEQ ID NO: 44) clone (isolated in Example 4 filed as partial sequence in WO98/11205) was amplified with primers similarly designed to those used in Example 3 and used in a method in accordance to that described in Example 3. The final vectors were also modified by the addition of the PDK intron or the Eucalyptus xylem intron, the E. grandis COMT promoter and the use of the pART29 binary vector using the methods described in Example 3.

TABLE 13 Final Vector Fragment Intron pARB523 SEQ ID NO: 44 SEQ ID NO: 15 pARB524 SEQ ID NO: 44 Eucalyptus Xylem intron

Example 10 Construction of E. grandis CCR Expression Vectors

The following vectors were cloned as described in Example 3, with the modifications that a fragment from the E. grandis CCR (cinnamoyl CoA reductase) clone (SEQ ID NO: 45) (isolated in Example 4) was amplified with primers similarly designed to those used in Example 3 and used in a method in accordance to that described in Example 3. The final vectors were also modified by the addition of the PDK intron or the Eucalyptus xylem intron, the E. grandis COMT promoter 485 bp fragment of U.S. patent application No. 10/703,091 and the use of the pART29 binary vector using the methods described in Example 3.

TABLE 14 Final Vector Fragment Intron pARB525 SEQ ID NO: 45 SEQ ID NO: 15 pARB526 SEQ ID NO: 45 Eucalyptus Xylem intron from patent WO00/22092

Example 11 Construction of E. grandis C3H and C4H Expression Vectors

The following vectors were cloned as described in Example 3, with the modifications that the fragments from the E. grandis C3H clones (isolated in Example 4) (SEQ ID NO: 46) or E. grandis C4H (SEQ ID NO: 47) clones (isolated in Example 4; filed as partial sequence in WO00/22099) amplified with primers similarly designed to those used in example 2 and used in a method in accordance to that described in Example 3. Either the Arabinogalactan promoter from E. grandis (SEQ ID NO: 35) or the 4CL promoter from P. taeda (U.S. Pat. No. 6,252,135) was used in these vectors. The P. radiata superubiquitin promoter intron vector was digested with the BamHI restriction enzyme and, using standard techniques, cloned into Bluescript vectors containing either a 4CL promoter from P. taeda (digested with BamHI), or the Arabinogalactan promoter from E. grandis (digested with ClaI). The P. taeda 4CL promoter and the E. grandis Arabinogalactan promoter were both amplified using primers similarly designed to those used to amplify the P. radiata superubiquitin promoter sequence with intron and then ligated into the base Bluescript vector as described in Example 3. The final vector was also modified by the addition of the Pr4CL intron, and the use of the pARB1002 binary vector, using the methods described in Example 3.

TABLE 15 Final Vector Promoter Fragment pARB669 Eucalyptus grandis Arabinogalactan SEQ ID NO: 46 2446 bp (SEQ ID NO: 35) pARB670 Eucalyptus grandis Arabinogalactan SEQ ID NO: 47 2446 bp (SEQ ID NO: 35) pARB672 Pinus taeda 4CL (SEQ ID NO: 77) SEQ ID NO: 47

Example 12 Evaluation of 4CL Constructs in Eucalyptus

Three different constructs containing RNAi fragments of two different lengths, pARB339, pARB341 and pARB345 (see Table 16) were transformed into Eucalyptus grandis using the following procedure.

TABLE 16 DNA Construct Name Construct description pARB339 constitutive promoter driving 4CL RNAi 200 bp fragment pARB341 constitutive promoter driving 4CL RNAi 600 bp fragment pARB345 vascular-preferred promoter driving 4CL RNA1 200 bp fragment

Clonal Eucalyptus grandis leaf explants micropropagated in culture on elongation media—(MS with 1 μM BAP, 20 g/L sucrose and 7 g/L agar) were used for transformation. Transformation was carried out as described in Burrel et. al. International publication number WO00/12715, which is hereby incorporated by reference.

Transgenic explants were selected as described in WO00/12715 except that NAA was omitted, and media contained 50 mg/L kanamycin and 250 mg/L timentin. Explants remained on this medium for two weeks, and were then transferred to media containing 100 mg/L kanamycin and 250 mg/L timentin after 2 weeks, and media containing 150 mg/L kanamycin and 250 mg/L timentin after another two weeks. Cultures were then transferred on a monthly basis to fresh media containing 150 mg/L kanamycin and 250 mg/L timentin until healthy single shoots could be collected. Single shoots were placed onto elongation media to proliferate the putative transgenic tissue. When approximately 200 mg of tissue could be collected from the proliferating tissue, this was removed from the primary explant for PCR analysis. PCR analysis for both the presence of the promoter and selection gene was carried out using the PuRe Taq Ready-To-Go™ PCR beads (Amersham Biosciences), according to the manufacturer's instructions.

Tissues with positive PCR results were then proliferated further on elongation medium containing 150 mg/L kanamycin and 250 mg/L Timentin, and maintained as stock cultures.

To generate transgenic plants for further testing, some shoots were placed onto an elongation medium. Shoots were maintained on this medium until they were approximately 2-3 cm tall. If this took more than 1 month shoots were placed onto fresh medium at monthly intervals. Once shoots were 2-3 cm tall, single shoots were removed and placed into a rooting medium. After 10 days in rooting medium plants were transferred to the greenhouse. Those skilled in the art of plant transformation and plant tissue culture will recognize that many different culture media and intervals may be suited to regenerating plants of the instant invention.

Plants were grown in the greenhouse for six months in potting mixture, using an appropriate humidity regime and fungicides to control fungal growth. Plants were grown in a meshed compartment at ambient temperature with capillary watering. Plants were potted into 5 L poly-bags in s soil-less peat based compost supplemented with a slow release fertilizer.

Plants at approximately six months of age were destructively sampled for total lignin analysis.

Height Measurements

Table 17 lists the percentage of micropropagated plants selected with the use of kanamycin that survived in soil after six months, the percentage of dwarfed plants observed at 20 weeks after being planted in soil and average height of plants at 22 weeks after being planted in soil of Eucalyptus plants transformed with pARB339, pARB341 or pARB345.

Survival data of plants transformed with pARB341 was much lower than that of plants transformed with pARB339 or pARB345. Of all the plants transformed with pARB341 that survived, 82% were dwarfed suggesting that the DNA vector pARB341 affected the height and survival rate of the plants, to a greater extent than the other two vectors (pARB339 and pARB345).

TABLE 17 Mean height of % plants plants analyzed % Survived dwarfed at 20 for lignin content Construct after 6 months weeks at 22 weeks (cm) pARB339 95 2.8 117 pARB341 38 82 13 pARB345 83 2.9 127

The data presented in FIGS. 3 and 4A demonstrate the apparent effect of each construct on plant height. While the tallest individual plants in each set of plants transformed with pARB345 and pARB339 are close (159 and 168 cm, respectively) the shortest pARB339 plants (53 cm, 64 cm) are much shorter than the shortest pARB345 plants (91 cm, 96 cm). This figure does not include the average height of the dwarf pARB341 samples that were pooled for analysis. The average height of the dwarf pARB341 plants was 13 cm.

Lignin Analysis

Transgenic Eucalyptus trees generated as described in the previous example were sampled for lignin analysis at approximately six months of age. The bottom 20 cm of the stem was collected from all the samples to be analyzed. The bark, phloem and the primary cortex was removed from the stem by peeling, and the stem samples were then flash frozen in liquid nitrogen. Frozen samples were freeze-dried in a Flexi-Dry Microprocessor control—corrosion resistant freeze-drier (Stone Ridge, N.Y., USA) according to the manufacturer's instructions. Samples were ground in a Wiley Mill (Arthur H. Thomas Co,; Philadelphia, U.S.A) and then re-ground in a ring mill. Ground samples were then dried for a minimum of 1 day at 55° C. and stored at this temperature until used. Cell wall material was isolated from the samples in a series of stages by suspending the ground material in the solvent or solution, extracting with an ultrasonic cleaner, centrifuging and then decanting off the supernatant. The following sequence of extractions was used: NaCl at two concentrations, aqueous ethanol; CHCl₃:MeOH; and acetone. To remove the starch, the extracted cell wall materials were washed, heated in tris-acetate buffer to gelatinize the starch and then treated with α-amylase. Following enzyme treatment the suspension was centrifuged and the resulting precipitate washed with ethanol and acetone, allowed to stand overnight, and then dried at 55° C. The isolated cell material was used for small scale lignin determinations carried out using the procedure described in Fukushima, R. S. and Hatfield, R. D. (2001) J. Ag. Food Chem. 49(7):3133-9. Results are shown in FIGS. 4A & 4B.

The RNAi cassette in pARB341 resulted in 82% of all transformed plants to be dwarfed. A pooled sample of these plants showed that they had reduced lignin levels, to approximately 80% of normal levels. This vector had the greatest effect on plant height when compared to the other two vectors tested and also a large effect on reducing lignin levels. While the extreme end of the lignin-reduction ranking features dwarf phenotypes, the lowest-lignin transline of all identified in this study, a pARB345 transline, has reasonably normal height. Hence the dwarfism seen in many of the pARB341 transformants may be a separate phenomenon caused by suppression of genes other than the 4CL gene expressed in lignifying secondary xylem, for example 4CL genes expressed in other parts of the plant or genes with partial homology to 4CL.

The RNAi cassette in pARB345 was found to be more effective than that in pARB339 at producing phenotypes with significantly reduced lignin. The 200 bp RNAi cassette in pARB345 is capable of inducing lignin reductions up to −25% without also triggering the dwarfing effect induced in many transformants by the 600 bp RNAi cassette driven by the same promoter in pARB341.

Nine plants transformed with pARB345 were selected from the lignin analysis above and a second 20 cm stem sample harvested from above the first were submitted for lignin content determination using pyrolysis molecular beam mass spectrometry and by solid-state ¹³C NMR for comparison of methods. All three methods gave approximately the same values for lignin reduction.

For pyrolysis molecular beam mass spectrometry, each sample was weighed in a quartz boat, and pyrolyzed in a reactor consisting of a quartz tube (2.5 cm inside diameter) with helium flowing through at 5 L/min (at STP). The reactor tube was placed such that the sampling orifice of the molecular-beam mass spectrometer was inside the end of the quartz reactor. A molecular-beam mass spectrometer using a Extrel™ Model TQMS C50 mass spectrometer was used for pyrolysis vapor analysis as described in Evans & Milne (1987) (Energy & Fuels, 1: 123-37). The reactor was electrically heated and its temperature maintained at 550° C. Total pyrolysis time was 90 seconds although the pyrolysis reaction was completed in less than 50 seconds. The residence time of the pyrolysis vapors in the reactor pyrolysis zone has been estimated to be ˜75 ms and is short enough that secondary cracking reactions in the quartz reactor are minimal. Mass spectral data from 20-450 Da were acquired on a Teknivent Vector 2™ data acquisition system using 22 eV electron impact ionization. Using this system, both light gases and heavy tars were sampled simultaneously and in real time. The mass spectrum of the pyrolysis vapor provides a rapid, semiquantitative depiction of the molecular fragments.

Principal component analysis of the pyMBMS spectra using a mass range between m/z 50 and 200 highlighted pyrolysis products from lignin and carbohydrates while minimizing small pyrolysis and electron impact fragments (below m/z 50) and extractives (above m/z 200).

For NMR determination of lignin content, high-resolution, solid-state ¹³C NMR spectra were collected at 4.7 T with cross-polarization (CP) and magic angle spinning (MAS) in a Bruker Avance 200 MHz spectrometer. Variable amplitude cross-polarization (1 db linear ramp over cross polarization period) was used to minimize variations of the nonprotonated aromatic carbons that are sensitive to Hartmann-Hahn mismatch at higher MAS rotation rates (S. O Smith, I. Kustanovich, X. Wu, O. B. Peersen, Journal of Magnetic Resonance (1994) 104: 334-339). ¹H and ¹³C fields were matched at 53.6 kHz and a 1 dB ramp was applied to the proton r.f. during the matching period. Acquisition time was 0.033 seconds and sweepwidth was 31.3 kHz. Magic-angle spinning was performed at a rate of 7000 Hz. 2000-4000 scans were averaged using a 2 ms contact time and a pulse repetition rate of 1.0 sec. Differences observed in relative peak intensities and integrated areas can be used to identify differences between similar samples. Weight % lignin values were calculated from the integrated areas of the aromatic (110 ppm-160 ppm) and carbohydrate (40 ppm-100 ppm) region using the method of Haw et al 1984 (J. F. Haw., G. E. Maciel., H. A. Schroder, Analytical Chemistry 56: 1323).

Data analysis was performed using the Unscrambler version 7.8 software program (CAMO A/S, Trondheim, Norway). The Projection to Latent Structure (PLS-1) algorithm, which handles only one Y-variable at a time, was used to construct the model for predicting the lignin contents of the pine samples. The lignin content predictive model was developed using the pyMBMS spectra as the X-matrix (310 variables (m/z values between 50 and 360)) and the lignin values measured by solid-state NMR as the Y-matrix. The mass spectra were normalized to the total ion current before analysis. Model validation was performed using full cross validation which systematically removes one sample from the data, establishes a model with the remaining samples and then uses that model to predict the value of the Y-variable of the samples that was removed from the data set. The process continues until all samples have been removed and predicted from the Y-matrix. The goodness-of-fit (i.e., a high correlation coefficient) and minimal residual error were the criteria used for choosing the best model.

A PLS1 model to predict lignin content was constructed from the NMR lignin values and the pyMBMS spectra. In cases where more than one tree from the same line was sampled for the NMR analysis, the corresponding mass spectra from the trees were averaged and used to build the model. A PLS model was constructed using a range of m/z values from 50 to 360. This range was determined empirically to provide the best model based on the correlation coefficient of the fully cross-validated model. The final fully cross-validated model shown in FIG. 8, had a RMSEP of 0.9 and an r² value of 0.94.

Table 18 shows a comparison of the NMR results for the nine selected samples. Comparison of the NMR wt % lignin values with the PC1 scores for the selected samples show that the PC1 scores accurately reflect the amount of lignin in the loblolly pine samples and the PC1 scores can be used to rank the lignin content of the different constructs. There is also excellent correlation between the NMR-determined lignin content and the content as determined by acetyl bromide as described above.

TABLE 18 Average Lignin (%) Pyrolysis molecular beam mass determined Eucalyptus grandis spectrometry data analysis NMR by Acetyl clone, construct and Average Average lignin Bromide event number PC1 Deviation PC2 Deviation values method 824.019 pARB345- 2.8335 0.287792 −0.567 0.100409 14.1 15.83 002-3 824.019 pARB345- −3.4605 1.069853 −0.7475 0.245366 19.5 20.05 014-1 824.019 pARB345- −0.568 1.52028 0.11718 0.115711 17 16.22 015-2 824.019 pARB345- −2.5165 2.181424 0.5005 2.085258 19.1 20.6 026-1 824.019 pARB345- −4.819 0.254558 −1.0015 0.939745 20.1 19.24 033-1 824.019 pARB345- 2.395 0.588313 0.5765 0.420729 14.4 15.86 034-3 824.019 pARB345- −0.435 1.200667 0.65 0.767918 15.7 18.1 039-2 824.019 pARB345- −1.43831 1.897436 −0.259 0.690136 19.9 19.5 041-5 824.019 pARB345- 1.4815 1.8109 3.008 0.95318 14.9 15.4 044-1

Histochemical tests for lignin, which detects coniferaldehyde units using phloroglucinol/HCl, were applied to hand sections taken from side branches from transgenic plants containing the DNA constructs of the instant invention. Phloroglucinol, also known as the Weisner reagent, is a stain for lignin (Pomar et al., Protoplasma, 220(1-2):17-28 (2002), and Maule stain is used to detect specifically syringyl lignin subunits (Lewis et al., Annu Rev Plant Physiol Plant Mol Biol, 41:455-496 (1990). Transgenic plants transformed with pARB339 and pARB345 showed no observable difference to control untransformed plants. Normal height pARB341 plants also had no observable difference to control plants, whereas dwarf pARB341 plants had a reduced amount of phloroglucinol staining, suggesting that lignin levels were greatly reduced in these samples. Examination of stained sections of the dwarf pARB341 translines showed that there was transline-to-transline variation. Two ramets of one dwarf transline with a particularly extreme anatomical phenotype were highly consistent in their appearance, suggesting the observed perturbations in lignin deposition and anatomy have a (trans)genetic basis. Hand cut sections of dwarf and normal sized pARB341 plants were also stained with Maule stain This stain is specific for subunits of syringyl lignin (Strivastava L M. 1966. Histochemical studies on lignin. Tappi Journal 49:173-183).

As with sections stained with phloroglucinol, there was dramatically less lignin observed in the dwarf plants than the “normal” plants and a lack of vascular differentiation in the stems of the dwarf plants was evident.

Dwarf pARB341 plants were also phenotypically different to their tall counterparts because they had wood that was a pink colour. This was observed once the stems were peeled. The stems of these plants were also soft and rubbery compared to the tall plants. Interestingly a few pARB345 plants with a tall/“normal” phenotype also had pink wood when the bark, phloem and primary cortex were peeled off.

Two wild-type samples and 10 transgenic samples were examined by confocal microscopy. The 10 transgenic samples examined included 5 pARB339 plants, one with pink wood, 2 dwarf pARB341 plants, both with pink wood, and 3 pARB345 plants, 2 of which had pink wood. Stem segments 2-3 cm long were fixed in formalin aceto-alcohol (FAA). Samples were washed in water and sectioned at a thickness of 30-60 mm using a sledge microtome. Sections were stained using safranin and phloroglucinol/HCl for anatomical analysis using the confocal microscope. Some samples were examined with toluidine blue stain.

All of the samples contained large and varying amounts of tension wood, present in patches often only on one side of the stem. This was characterized by extremely thick walled fibres with a more or less unlignified secondary wall. In tension wood in all samples, reduction in lignification was confirmed by a reduction in red coloration by phloroglucinol/HCl, and increase in green fluorescence with safranin staining, and by a pink staining with toluidine blue. To distinguish a transgenic phenotype from the tension wood effect, in all samples the areas of stem that were normal wood, that did not show the staining pattern typical of tension wood were examined using confocal microscopy with safranin staining, and also using phloroglucinol/HCl staining. There were no obvious indications of altered cell wall composition in normal fibres or vessels in most of the samples. Two samples from pARB341 transgenic trees showed an anatomical phenotype indicative of altered cell wall composition: a significant reduction in vessel diameter and a wavy appearance of the vessel cell walls. At least one of these samples also showed changes outside of the xylem (lignified tissues in the pith). However, it is notable that samples from the non-dwarf, low-lignin samples identified above did not show anatomical abnormalities detectable by confocal microscopy. The results demonstrate that the constructs of the instant invention can give rise to a variety of combinations of height growth, reduced lignin content, and altered anatomical phenotype. Thus, the disclosed methods enable the generation and selection of transgenic trees that exhibit the most desirable combinations of phenotypes for pulp production or other wood-derived products.

Example 13 Evaluation of 4CL Constructs in Loblolly Pine

Lignin Evaluation Using PyMBMS

Loblolly pine (Pinus taeda) and hybrid pine (P. taeda x P. rigida) embryogenic cell lines were initiated from zygotic embryos of individual immature megagametophytes using the procedures described in U.S. Pat. No. 5,856,191, and maintained using the procedures described in U.S. Pat. No. 5,506,136.

After one to three months of culture on maintenance medium, the tissue cultures were cryopreserved, stored for periods of up to several years, and then retrieved using the methods of U.S. Pat. No. 6,682,931. Those skilled in the art of plant tissue culture will recognize that other cryopreservation and recovery protocols would be applicable to the present method and that the detail in this example may not be construed to limit the application of the method.

Uniform suspension cultures from each of the genetically different tissue culture lines were established by inoculating a 250 ml Nephelo sidearm flask (Kontes Chemistry and Life Sciences Products) with 1 g of tissue each according to the method of U.S. Pat. No. 5,491,090. The flasks containing the cells in liquid medium were placed on a gyrotory shaker at 100 rpm in a dark culture room at a temperature of 23° C.±2° C. One week later, the liquid in each flask was brought to 35 ml by pouring 15 ml fresh medium into the culture flask and swirling to evenly distribute the cells. Cell growth was measured in the sidearm by decanting cells and medium into the sidearm portion of the flasks, allowing the cells to settle for 30 minutes and then measuring the settled cell volume (SCV). When the SCV was greater than or equal to half the maximal SCV (50% of the volume of the flask was occupied by plant cells), each culture was transferred to a 500 ml sidearm flask containing a total of 80 ml cells and medium and the transferred culture was maintained under the same conditions.

To prepare for gene transfer, polyester membrane supports were sterilized by autoclaving and placed in separate sterile Buchner funnels, and for each of six replicate plates per cell line, one to three milliliters of pine embryogenic suspension was pipetted onto each support such that the embryogenic tissue was evenly distributed. The liquid medium was suctioned from the tissues and each support bearing the embryogenic tissue was placed on gelled preparation medium for Agrobacterium inoculation according to the methods described in U.S. Patent Publication No. 20020100083. Specifically, the binary constructs pWVC60, pWVC62, pWVK158, pWVK154, pWVK157, pWVK155, pWVK143, pWVC46, pWVC40, pWVC43, and pWVC44 were each introduced into different isolates Agrobacterium tumefaciens by techniques well known to those skilled in the art, and virulence was induced with administration of acetosyringone by commonly used techniques whereupon each of the induced Agrobacterium isolates was co-mingled with separate replicates of the plant material. The cells were co-cultivated in the dark at 22°±2° C. for approximately 72 hours.

Following co-cultivation, Agrobacterium was eradicated from the cultures according to the methods described in U.S. Patent Publication No. 20020100083. Cells borne on polyester membrane supports were then transferred onto fresh selection media at intervals of 2 weeks. Active growth on the selection medium occurred in a number of isolated sectors on many of the petri dishes. Such active growth in the presence of selection agent is normally an indication that the growing tissues have integrated the selection gene into their chromosomes and are stably transformed. These areas of active growth are treated as independent transformation events and are henceforth referred to as putative transgenic sublines. The putatively transgenic embryogenic tissue was multiplied by transferring growing transgenic sectors to fresh semi-solid maintenance medium supplemented with the respective selection agent.

Putatively transformed sublines, after reaching approximately 2 g, were chosen for polymerase chain reaction (PCR) amplification for verification of the presence of transgenes using standard techniques.

TABLE 19 Primer Pairs for PCR (SEQ ID NOS 68-75 respectively in order of appearance) Product size virD2 GAA GAA AGC CGA AAT AAA GAG G 560 virD2 TTG AAC GTA TAG TCG CCG ATA G These primers were used to check contamination by Agrobacterium NptII AAG GAG ATA TAA CAA TGA TTG AAC AAG ATG GAT TGC 800 NptII TCA GAA GAA CTC GTC AAG AAG G 800 uid(gus) CGA AAA CGG CAA GAA AAA GCA G 450 uid(gus) ACG ACC AAA GCC AGT AAA GTA G Pal AAT GGG AAG CCT GAG TTT ACA 700 Pal GGC CAG CAT GTT TTC CTC CAG These primers, for the PAL gene, were used as a positive control

Material from each subline also was sacrificed for GUS staining and microscopic examination. For GUS staining, an inserted uidA gene, encoding a β-glucuronidase enzyme expressing in tissue culture cells, was detected by deep blue staining of cells from each of the transgenic lines upon exposure to a colorigenic glucuronidase enzyme substrate, “X-gluc,” commercially available from Inalco, according to techniques well known in the art of plant transformation. Microscopic examination demonstrates that cell division has resumed and that transient expression of the uidA transgene displays the normal frequency for these bombardments.

Germinable embryos were produced as follows. After the cell masses that had been cultured on selection medium proliferated to at least one gram, each was separately resuspended in liquid medium again. When the cell suspensions were brought to uniform (half-maximal) SCV, equivalent amounts of suspension culture cells were pipetted onto sterile membrane supports for placement on development/maturation medium as described in U.S. Pat. No. 5,506,136 to develop high quality harvestable stage 3 (cotyledonary) embryos. Dishes were incubated in a dark growth chamber at 23±2° C. The membrane supports were transferred to new petri dishes containing fresh medium every 3 weeks. At week 9, stage 3 (cotyledonary) embryos were visually analyzed for germination quality and harvested onto fabric supports on medium as described in U.S. Pat. No. 5,506,136, and incubated for about four weeks in the dark at a temperature of 4° C.±2° C. Next, embryos on their fabric supports were incubated above water in sealed containers for about three weeks in the dark at a temperature of 25° C.±2° C. Following the above two treatments, embryos on their fabric supports were transferred to medium germination medium and incubated for about three days in the dark at a temperature of 25° C.±2° C. Embryos were then removed from their fabric supports and placed onto the surface of fresh germination medium. Germination was conducted in the light at a temperature of 25° C.±2° C. Germination plates were examined weekly, over a period of about four weeks, and germinating embryos were transferred to MAGENTA® boxes containing 100 ml of germination medium for conversion to plantlets. MAGENTA® boxes containing developing plantlets were incubated in the light at 25° C.±2° C. for about eight to twelve weeks.

When the plantlets formed epicotyls (newly formed shoots of approximately two to four cm), they were transferred to containers filled with a potting mix [2:1:2 peat:perlite:vermiculite, containing 602 g/m³ OSMOCOTE fertilizer (18-6-12), 340 g/m³ dolomitic lime and 78 g/m³ MICRO-MAX micronutrient mixture (Sierra Chemical Co.)]. The plantlets were grown in a shaded greenhouse and misted infrequently for a period of about two weeks. They were removed from mist for acclimatization in the greenhouse for about four weeks. Plantlets were then transferred to outdoor shade for about six weeks for final acclimatization before moving to full-sun conditions. They were then grown in containers until conditions were ready for field planting.

Heights of five month loblolly pine trees transformed with the RNAi vectors as noted above were measured and the results recorded (Table 20). A Duncan Multiple Range test was done on the height data and found that plants transformed with vectors containing the RNAi cassettes of pWVK157, pWVK155, pWVC40, pWVC43 and pWVC44 did not have any significant difference in height compared to GUS control plants (pWVC41), whereas all other transformed lines did have a significant difference in height to the controls. A single untransformed control also was measured to be 21.1 cm tall but statistic analysis was not done with this sample as it was a single result and not an average of multiple samples. Root dry weights also were measured for all the transformed and control trees at 5 months but no significant difference was observed between controls and transgenics.

At seven months of age approximately 200 samples were collected from the above transformed trees or control untransformed trees by cutting approximately 20 mg of tissue from each stem. Each sample was weighed in a quartz boat, and pyrolyzed in a reactor consisting of a quartz tube (2.5 cm inside diameter) with helium flowing through at 5 L/min (at STP). The reactor tube was placed such that the sampling orifice of the molecular-beam mass spectrometer was inside the end of the quartz reactor. A molecular-beam mass spectrometer using a Extrel™ Model TQMS C50 mass spectrometer was used for pyrolysis vapor analysis as described in Evans & Milne (1987) (Energy & Fuels, 1: 123-37). The reactor was electrically heated and its temperature maintained at 550° C. Total pyrolysis time was 90 seconds although the pyrolysis reaction was completed in less than 50 seconds. The residence time of the pyrolysis vapors in the reactor pyrolysis zone has been estimated to be ˜75 ms and is short enough that secondary cracking reactions in the quartz reactor are minimal. Mass spectral data from 20-450 Da were acquired on a Teknivent Vector 2™ data acquisition system using 22 eV electron impact ionization. Using this system, both light gases and heavy tars are sampled simultaneously and in real time. The mass spectrum of the pyrolysis vapor provides a rapid, semiquantitative depiction of the molecular fragments.

Duplicate mass spectra of the loblolly pine sample set and standards were collected on two successive days in a block fashion so as to mitigate problems associated with data analysis that could arise from day to day spectrometer drift. A combined analysis of the mass spectra collected on both days indicated that minimal spectrometer drift occurred.

Examination of the spectra determined that mass spectra of the transgenic samples are different from the controls. An example of the pyMBMS spectra of the pyrolysis products from a transgenic and control loblolly pine sample are shown in FIG. 14.

Principal component analysis of loblolly pine pyMBMS spectra using a mass range between m/z 50 and 200 highlighted pyrolysis products from lignin and carbohydrates while minimizing small pyrolysis and electron impact fragments (below m/z 50) and extractives (above m/z 200). By selecting a mass range that contained more information about lignin and less about the extractives, it became clear that there were significant differences between the constructs. FIG. 15A shows a scatter plot of PC1 scores versus PC2 scores of mass spectra collected using a mass range of m/z 50-200 for all the transgenics analyzed. From this scatter plot we can conclude that plants transformed with some vectors show clear separations to control untransformed plants due to differences in the amount of lignin as determined from the analysis of mass spectra and PC loadings, while others do not. FIGS. 15B, 16A and 16B provide additional insights. Trees transformed with pWVC41 were GUS control transgenics and showed no difference from the control untransformed trees. Trees transformed with pWVC40 and pWVK154 both contained the pine 4CL fragment D coding sequence (SEQ ID NO: 21) and trees transformed with pWVC46 and pWVK158 both contained the pine 4CL fragment C (SEQ ID NO: 20) coding sequence. Each of these transformants separated from the control samples on the scatter plots, indicating a difference in the amount of lignin between the transgenics and controls.

FIG. 17 shows expanded mass spectrum region of samples selected in FIG. 16A, the control, the transgenics pWVC40 and pWVK154. It is clear that the peaks arising from the pyrolysis of lignin are decreasing with respect to other peaks that can be assigned to carbohydrates and extractives (see Table 21). Similar analysis of the mass spectra of the other constructs indicates that PC1 reflects the concentration of lignin in each sample. Samples to the right in FIGS. 15-16 have the highest lignin content and samples to the left have much lower lignin content.

Seven month old loblolly pine trees transformed with pWVK158, pWVK154, pWVC46 and pWVC40 showed the greatest reduction in lignin content when compared to untransformed controls and GUS transformed controls. Trees transformed with pWVK158, pWVK154 and pWVC42 were significantly shorter than untransformed and GUS transformed trees, where as trees transformed with pWVC40 had a significant lignin reduction but no significant height reduction.

Lignin Evaluation Using Nuclear Magnetic Resonance Spectroscopy

High-resolution, solid-state ¹³C NMR spectra were collected at 4.7 T with cross-polarization (CP) and magic angle spinning (MAS) in a Bruker Avance 200 MHz spectrometer. Variable amplitude cross-polarization (1 db linear ramp over cross polarization period) was used to minimize variations of the nonprotonated aromatic carbons that are sensitive to Hartmann-Hahn mismatch at higher MAS rotation rates (S. O Smith, I. Kustanovich, X. Wu, O. B. Peersen, Journal of Magnetic Resonance (1994) 104: 334-339). ¹H and ¹³C fields were matched at 53.6 kHz and a 1 dB ramp was applied to the proton r.f. during the matching period. Acquisition time was 0.033 seconds and sweepwidth was 31.3 kHz. Magic-angle spinning was performed at a rate of 7000 Hz. 2000-4000 scans were averaged using a 2 ms contact time and a pulse repetition rate of 1.0 sec. Differences observed in relative peak intensities and integrated areas can be used to identify differences between similar samples. Weight % lignin values were calculated from the integrated areas of the aromatic (110 ppm-160 ppm) and carbohydrate (40 ppm-100 ppm) region using the method of Haw et al 1984 (J. F. Haw., G. E. Maciel., H. A. Schroder, Analytical Chemistry 56: 1323).

Twelve samples were selected based on their PC1 scores and the lignin content was determined using solid-state ¹³C NMR. In some cases, several samples from the same line were combined in order to get a sample that was large enough for the NMR analysis. FIG. 18 shows a comparison of the NMR spectra of a control line (two samples combined) and a transformed line pWVK154 (four samples—combined). The NMR spectra confirmed the results of the pyMBMS analysis that pWVK154 transgenics had a much lower lignin content than the control line. The weight % lignin was determined by integration of the aromatic and carbohydrate regions combined with some assumptions of the lignin and carbohydrate structures (see Haw et al., (1984) Analytical Chemistry, 56: 1323). The results for the 12 selected samples are given in Table 22. Comparison of the NMR wt % lignin values with the PC1 scores for the selected samples show that the PC1 scores accurately reflect the amount of lignin in the loblolly pine samples and the PC1 scores can be used to rank the lignin content of the different constructs.

Lignin Evaluation Using Multivariate Data Analysis

Data analysis was performed using the Unscrambler version 7.8 software program (CAMO A/S, Trondheim, Norway). The Projection to Latent Structure (PLS-1) algorithm, which handles only one Y-variable at a time, was used to construct the model for predicting the lignin contents of the pine samples. The lignin content predictive model was developed using the pyMBMS spectra as the X-matrix (310 variables (m/z values between 50 and 360)) and the lignin values measured by solid-state NMR as the Y-matrix. The mass spectra were normalized to the total ion current before analysis. Model validation was performed using full cross validation which systematically removes one sample from the data, establishes a model with the remaining samples and then uses that model to predict the value of the Y-variable of the samples that was removed from the data set. The process continues until all samples have been removed and predicted from the Y-matrix. The goodness-of-fit (i.e., a high correlation coefficient) and minimal residual error were the criteria used for choosing the best model.

A PLS1 model to predict lignin content was constructed from the NMR lignin values and the pyMBMS spectra. In cases where more than on tree from the same line was sampled for the NMR analysis, the corresponding mass spectra from the trees were averaged and used to build the model. A PLS model was constructed using a range of m/z values from 50 to 360. This range was determined empirically to provide the best model based on the correlation coefficient of the fully cross-validated model. The final fully cross-validated model shown in FIG. 19, had a RMSEP of 0.9 and an r² value of 0.94.

The lignin level was determined for each of the transformed lines using an NMR-based model developed by the National Renewable Energy Laboratory (Golden, Colo.). Table 20 shows the percentage of lignin compared to non-transformed controls for each of the RNAi constructs. All of the transformants showed reduced lignin relative to control plants, though different lines possessed different amounts of lignin. Transformants comprising constructs with fragments C or D showed the most lignin reduction.

TABLE 20 Effect of RNAi constructs on lignin level Percentage of lignin relative to non-transformed controls RNAi fragment A B C D E F 4CL promoter 78.4 na 66.4 76.3 91.5 91.2 SUBQ promoter 85.5 79.2 74.2 62.5 94.0 98.6

FIG. 10 provides a graph showing the lignin values obtained for each transformant. The constructs are listed in order of average height in the x-axis. Accordingly, the results show that in pine, fragments C and D were associated with an average reduction in growth as well as lignin. Fragment E did not reduce growth, but also did not reduce lignin much. The best lignin reduction that was unaccompanied by an average growth reduction was seen with Fragment A (driven by either promoter) or with Fragment F (driven by 4CL promoter). These constructs constitute the appropriate phenotype for forestry applications.

Table 21 provides mass spectrum peak assignments associated with pyrolysis molecular beam mass spectroscopy of loblolly pine wood samples (Evans et al, Energy & Fuels, 1:123-137 (1987)).

TABLE 21 m/z Assignment 57, 73, 85, 96, 114, 96 C5 sugars 57, 60, 73, 98, 126, 144 C6 sugars  94 Phenol 110 catechol, resorcinol 120 Vinylphenol 122 Ethylphenol 124 Guaiacol  137¹ ethylguaiacol, homovanillin, coniferyl alcohol 138 Methylguaiacol 150 Vinylguaiacol 164 allyl-+propenyl guaiacol 178 coniferyl aldehyde 180 coniferyl alcohol, syringylethene 272 G-G lignin dimer  285¹ Dehydroabietic acid 300 Dehydroabietic acid 302 abietic acid ¹fragment ion.

TABLE 22 Weight % lignin values determined by NMR. Line transformed with which construct NMR-determined weight % lignin pWVK154 16 pWVC46 17 pWVC46 19 pWVK143 21 pWVC60 21 pWVC44 23 pWVC60 24 pWVC40 24 pWVK157 25 pWVC43 27 pWVC44 28 Untransformed Control 29

Example 14 Field Test of Pine Transformants

Four to eight genetically identical propagules (ramets) were rooted from each of 122 lines for field planting, comprising approximately equal numbers of lines for each of the 16 constructs, for a total of approximately 1000 treestocks planted in a randomized block design. Lines transformed with 4CL promoter-driven constructs and superubiquitin promoter-driven constructs were planted in separate blocks of approximately 500 treestocks each with respective controls.

Constructs identified with an asterisk in Table 23 yielded at least some dwarfed transformants. As evident from the table, transformants with superubiquitin promoter-driven constructs were more likely to show dwarfing. Meanwhile, transformants with 4CL promoter-driven constructs were more likely to show reduced lignin without significant dwarfing, as can be seen in Table 23 below, in which Duncan's multiple range test was applied to height measurements. In Table 23, it can be observed that the transformants containing constructs driven by the vascular-preferred promoter are predominantly represented in the larger height class. Accordingly, constructs with tissue-preferred promoters are preferred.

TABLE 23 4CL RNAi-transformed and control trees planted in field test. Ranked by average heights (measured at age 8 months) and root masses (measured at age 12 months, i.e. at time of planting into field sites) of transgenic trees RNAi Some Root fragment events Duncan mass Duncan of the 4CL showed Height group (g dry group Promoter gene dwarfing (cm) height wt) roots 4CL GUS 21.4 a 2.31 ab 4CL frag E4CL 19.1 ab 2.29 ab SUBQ frag F4CL 18.9 a 2.47 a 4CL frag F4CL 17.6 ab 2.3 ab 4CL frag D4CL 17.2 ab 2.16 ab SUBQ frag E4CL 16.5 ab 1.91 b 4CL frag A4CL 15.6 bc 2.25 ab 4CL frag C4CL * 12.5 cd 1.93 ab SUBQ frag A4CL * 12.5 cd 2.25 ab SUBQ frag C4CL * 11.4 d 1.85 b SUBQ frag D4CL * 10 de 1.84 b SUBQ frag B4CL * 7.7 e 2.13 ab Duncan's multiple range test was performed on the height and root mass statistics

Example 15 Evaluation of Carbohydrate Levels

Secondary xylem (wood) is composed primarily of cellulose (a linear polymer of glucose), hemicelluloses (a linear heteropolysaccharide found in association with cellulose; in gymnosperms the principal component sugar is mannose) and lignin (a phenolic polymer that can not be depolymerized by hydrolysis). The varying levels of carbohydrates (CHOs) and lignin can affect the usefulness of the tree in processes such as pulping. Cellulose is the principal component of pulp yield, and yield may also be affected by the amount and type of hemicellulose associated with the cellulose. Additionally, the cellulose content of wood is positively correlated with strength, important both for pulp-derived and solid wood products.

Harding et. al. (1999) (Nat Biotechnol. 17(8):808-12) found that transgenic aspen trees with reduced lignin levels showed elevated CHO levels. Harding. et. al. claim that the elevation of CHO levels may be responsible for the preservation of plant structural integrity of trees with reduced lignin levels, and that such trees will show enhanced utility for pulping.

Transgenic plant material tested for total lignin amounts can be tested for carbohydrates (CHOs), as a measure of the amount of cellulose and hemicellulose present. Carbohydrate analysis is carried out on extractive free, ground samples. These samples are hydrolyzed in 2 stages with 72% sulphuric acid, firstly by incubations at room temperature for ½ hour, followed by incubation at 120° C. for 1 hour, decanted and analyzed by ion chromatography. From the chromatograms the percent dry wood weight (DWW) of arabinan, galactan, glucan, xylan and mannan are determined.

Hu et al. (1999) (Nature Biotechnology 17: 808-812) demonstrated that transgenic aspen trees downregulating the 4CL gene, exhibited up to a 45% reduction in lignin content and a 15% increase in cellulose content. Assessing carbohydrate levels of transgenic trees tested for lignin in Example 15 will determine whether these constructs show a correlation between decreasing lignin content and increasing cellulose content.

The results from CHO determinations of transgenic trees demonstrate which constructs are correlated with changes to cellulose or hemicellulose content in transformed trees. These results demonstrate that these constructs are enabled to modulate the cellulose content correlated with pulp yield and with strength of pulp fibers and solid wood products.

The constructs alter the cellulose or hemicellulose content in transformed trees. The reduction in lignin levels and increase in CHO levels of transformed trees provide economic and environmental advantages to the pulp industry. In particular, the reduction of lignin content should lead to a reduction of chemicals in pulping and bleaching processes.

Example 16 Additional Methods for Analyzing Lignin Content

In this example, anatomical analysis of older samples of genetic clones of trees examined previously in Example 13 is done in order to compare cell structure and lignin content in transgenic plants between plants of 6 months of age and plants of approximately 18 months of age. Additionally, transgenic plant material tested for total lignin amounts, CHO amounts and micro-pulped in Examples 11 and 13 respectively is examined by confocal microscopy to look at the cell structure present.

Samples are fixed in formalin aceto-alcohol (FAA). Samples are washed in water and sectioned at a thickness of 30-60 mm using a sledge microtome. Sections are stained using safranin staining and examined using a confocal microscope.

A histochemical test for lignin, which detects coniferaldehyde units using phloroglucinol/HCl, also is applied to the samples. Some samples are also examined with toluidine blue stain as an additional stain for lignin. This anatomical analysis identifies the amount of reaction wood present and whether wood (xylem) cells of transgenic plants display any differences with respect to control plants.

These results demonstrate the cell structure of transgenic trees shown to have reduced lignin levels in Examples 12 and 13, but showing normal morphology, have no significant differences to non-transgenic trees with “normal”/higher lignin levels. These results further demonstrate that the cell structure observed in 6 month old trees is consistent with observations in samples from 18 month old trees.

Example 17 Processing of Trees with Reduced Lignin

To determine whether reduced lignin content translates to improvements in the pulping process, the transgenic trees of the examples can be subjected to micro-pulping. Important parameters for determining the suitability of a wood resource for kraft pulping are pulp yield, pulping rate, alkali consumption, fibre qualities and pulp bleachability. Wood samples are air dried, chipped and then oven dried at 105° C. for at least two days and until a constant weight is reached. Kraft pulping is performed in 150 mL stainless steel reactors attached to the rotating arm of a Stalsvets multi-digester pulping unit (Stalsvets, Sweden). The reactors are rotated through a polyethylene bath heated by electric heaters having a total capacity of 12.5 kW and controlled by an Omron controller (Omron Corporation, Illinois, USA) Typical pulping conditions are:

Effective alkali charge: 14% (as Na₂O) Liquor sulphidity: 30% Liquor:wood ratio: 6:1 Maximum pulping temperature: 170° C. Time to maximum temperature: 90 minutes H-factor: Determined by varying the time at 170° C.

Those skilled in the art of pulp manufacture will recognize that many other combinations of micropulping conditions are available to test the pulpability of the wood of the trees of the instant invention. The reactors are quenched in cold water, and the cooked chips filtered off on a Buchner funnel. The filtrate is retained for residual alkali analysis. The cooked chips are washed extensively with tap water and then blended for 15 minutes in a standard British disintegrator. The resulting pulp is filtered on a Buchner funnel and washed with water until the filtrate is clear. The pulp pad is dried overnight at 60° C., and total yield determined by weighing.

Residual alkali is determined by titration with 0.5M hydrochloric acid to the first inflection point (Milanova, E. and Dorris, G. M., Nordic Pulp and Paper Research Jl., 9(1), 4-9 (1994)). Alkali consumption is the difference between the effective alkali charge on chips and residual alkali in the black liquor, expressed as a percentage of oven-dry chips (as Na₂O).

Pulp kappa number is determined by a half scale modification of Appita Standard 201m-86 (AS/NZS 1301.201s:2002). The pulping rate is calculated as the kappa number reached for a given cooking time.

Pulp bleachability is determined by bleaching pulps at 10% consistency using a D-Eo-D sequence (Kibblewhite et al., Appita, 51(2), 1145-121 (1998)) as follows: D stage: 0.25 active chlorine multiple, 100% industrial chlorine dioxide, 50° C., 60 minutes. Eo stage: 2% NaOH, 0.25 mPa O₂, 70° C., 60 minutes. D stage: 1% ClO₂, 70° C., 180 minutes. Following bleaching, 5 g brightness pads are prepared at pH 4-5.5, and brightness is determined after equilibration at 23° C./50% RH using a L & W Elrepho (Lorentzen & Wettre, Kista, Sweden). Fiber qualities such as average fiber length, width, and lumen size and standard deviations are analyzed using a Kaman Fiberglas system (Mets Automation, Kaman, Finland).

The results are correlated to the type of construct used in the transformation and demonstrate that the constructs effectively modulate the suitability of the wood resources for kraft pulping.

Table 24 provides the nucleic acid sequences of the polynucleotides and DNA constructs described herein.

TABLE 24 Seq ID Description Sequence  1 Linkers used for back AATTCGTCCAGCAGTTGTCTGGAGCTCCACCAGAAATCTGGA bone production  2 Linkers used for back AGCTTCCAGATTTCTGGTGGAGCGCCAGACAACTGCTTGACG bone production  3 Primer for P. radiata AGCTGAGCTCGGGTGTTATTTGTGGATAATAAATTCGGG SuperU 3′UTR  4 Primer for P. radiata GTTATGGTAAAGCAAATTATATTTCTGAGACAATAGGCACTCGAGTCGA SuperU 3′UTR  5 Primer for 3′ UTR and AAAATCGATGGGTGTTATTTGTGGATAATAAATTCGGG nos terminator fragment of pBI-121  6 Primer for 3′ UTR and GGTACCATTTAAATGCGGCCGCGATCTAGTAACATAGATGACACC nos terminator fragment of pBI-121  7 Primers for P. radiata AAATCTAGAGGTACCATTTAAATGCGGCCGCAAAACCCCTCACAAATACATAA SuprU promoter  8 Primers for P. radiata TTTCTGCAGCTTGAAATTGAAATATGACTAACGAAT SuprU promoter  9 Intron Sequence Pr4CL CAGGTCAGTAATCTTAACTTCCCTTTTGAAAACTCTTAAGAATGAAAATTTATCTTAAATTTAGAAAC TTTGGCTGATCTTTCGAAAATCTGCTAAATTTTTTGGAACCTTGGCCGATCTTTTAAAAATATGCGAA TTCTTTTAGCAATCTACAAATCTTTTTAAAATATATAATTGAAAATCTGCTAAATTTGTTGGAACCTTG ACTGTTCTTTTTAAAATATGCAAATTCTTTTAGCAACTTGCAAATTCTTTAGCAATCTACAAATCTTTT TAAAACATATAAATGAAAATGGACCAATTTTTCTAGCCCCTAAATTTTTTCTAGCCCCTTGCTTTTCCT TCCAAATACCCTACCTAATTTTGCATCTAACAGGCCCAATCATTTAACCTTTTCAGGGC 10 Primers to amplify Pr4CL CTCGAGCAGGTCAGTAATCTTAACTTCCCTT intron oARB625 11 Primers to amplify Pr4CL CTCGAGGCCCTGAAAAGGTTAAATGATTGGG intron oARB626 12 Primers for P. radiata GAATTCCTGCAGAAGCTTATCCTTGGGCAGGGATACGGCATGAC cDNA clone 13 Primers for P. radiata GAATTCCTGCAGAAGCTTGATTAGCAGGATCCACCTGGAAGCCTTTATATTG cDNA clone 14 Complete RNAi casette GGCCGCAAAACCCCTCACAAATACATAAAAAAAATTCTTTATTTAATTATCAAACTCTCCACTACCTT for pARB513 TCCCACCAACCGTTACAATCCTGAATGTTGGAAAAAACTAACTACATTGATATAAAAAAACTACATTA CTTCCTAAATCATATCAAAATTGTATAAATATATCCACTCAAAGGAGTCTAGAAGATCCACTTGGACA AATTGCCCATAGTTGGAAAGATGTTCACCAAGTCAACAAGATTTATCAATGGAAAAATCCATCTACC AAACTTACTTTCAAGAAAATCCAAGGATTATAGAGTAAAAAATCTATGTATTATTAAGTCAAAAAGA AAACCAAAGTGAACAAATATTGATGTACAAGTTTGAGAGGATAAGACATTGGAATCGTCTAACCAGG AGGCGGAGGAATTCCCTAGACAGTTAAAAGTGGCCGGAATCCCGGTAAAAAAGATTAAAATTTTTTT GTAGAGGGAGTGCTTGAATCATGTTTTTTATGATGGAAATAGATTCAGCACCATCAAAAACATTCAG GACACCTAAAATTTTGAAGTTTAACAAAAATAACTTGGATCTACAAAAATCCGTATCGGATTTTCTCT AAATATAACTAGAATTTTCATAACTTTCAAAGCAACTCCTCCCCTAACCGTAAAACTTTTCCTACTTCA CCGTTAATTACATTCCTTAAGAGTAGATAAAGAAATAAAGTAAATAAAAGTATTCACAAACCAACAA TTTATTTCTTTTATTTACTTAAAAAAACAAAAAGTTTATTTATTTTACTTAAATGGCATAATGACATAT CGGAGATCCCTCGAACGAGAATCTTTTATCTCCCTGGTTTTGTATTAAAAAGTAATTTATTGTGGGGT CCACGCGGAGTTGGAATCCTACAGACGCGCTTTACATACGTCTCGAGAAGCGTGACGGATGTGCGAC CGGATGACCCTGTATAACCCACCGACACAGCCAGCGCACAGTATACACGTGTCATTTCTCTATTGGAA AATGTCGTTGTTATCCCCGCTGGTACGCAACCACCGATGGTGACAGGTCGTCTGTTGTCGTGTCGCGT AGCGGGAGAAGGGTCTCATCCAACGCTATTAAATACTCGCCTTCACCGCGTTACTTCTCATCTTTTCT CTTGCGTTGTATAATCAGTGCGATATTCTCAGAGAGCTTTTCATTCAAAGGTATGGAGTTTTGAAGGG CTTTACTCTTAACATTTGTTTTTCTTTGTAAATTGTTAATGGTGGTTTCTGTGGGGGAAGAATCTTTTG CCAGGTCCTTTTGGGTTTCGCATGTTTATTTGGGTTATTTTTCTCGACTATGGCTGACATTACTAGGGC TTTCGTGCTTTCATCTGTGTTTTCTTCCCTTAATAGGTCTGTCTCTCTGGAATATTTAATTTTCGTATGT AAGTTATGAGTAGTCGCTGTTTGTAATAGGCTCTTGTCTGTAAAGGTTTCAGCAGGTGTTTGCGTTTT ATTGCGTCATGTGTTTCAGAAGGCCTTTGCAGATTATTGCGTTGTACTTTAATATTTTGTCTCCAACCT TGTTATAGTTTCCCTCCTTTGATCTCACAGGAACCCTTTCTTCTTTGAGCATTTTCTTGTGGCGTTCTG TAGTAATATTTTAATTTTGGGCCCGGGTTCTGAGGGTAGGTGATTATTCACAGTGATGTGGTTTCCCT ATAAGGTCCTCTATGTGTAAGCTGTTAGGGTTTGTGCGTTACTATTGACATGTCACATGTCACATATT TTCTTCCTCTTATCCTTCGAACTGATGGTTCTTTTTCTAATTCGTGGATTGCTGGTGCCATATTTTATTT CTATTGCAACTGTATTTTAGGGTGTCTCTTTCTTTTTGATTTCTTGTTAATATTTGTGTTCAGGTTGTA ACTATGGGTTGCTAGGGTGTCTGCCCTCTTCTTTTGTGCTTCTTTCGCAGAATCTGTCCGTTGGTCTGT ATTTGGGTGATGAATTATTTATTCCTTGAAGTATCTGTCTAATTAGCTTGTGATGATGTGCAGGTATA TTCGTTAGTCATATTTCAATTTCAAGCGATCCCCCGGGCTGCAGAAGCTTATCCTTGGGCAGGGATAC GGCATGACAGAAGCAGGCCCGGTGCTGGCAATGAACCTAGCCTTCGCAAAGAATCCTTTCCCCGTCA AATCTGGCTCCTGCGGAACAGTCGTCCGGAACGCTCAAATAAAGATCCTCGATACAGAAACTGGCGA GTCTCTCCCGCACAATCAAGCCGGCGAAATCTGCATCCGCGGACCCGAAATAATGAAAGGATATATT AACGACCCGGAATCCACGGCCGCTACAATCGATGAAGAAGGCTGGCTCCACACAGGCGACGTCGGGT ACATTGACGATGACGAAGAAATCTTCATAGTCGACAGAGTAAAGGAGATTATCAATATAAAGGCTTC CAGGTGGATCCTGCTAATCAAGCTTCTGCAGGAATTCGTCCAGCAGTCTCGAGCAGGTCAGTAATCTT AACTTCCCTTTTGAAAACTCTTAAGAATGAAAATTTATCTTAAATTTAGAAACTTTGGCTGATCTTTC GAAAATCTGCTAAATTTTTTGGAACCTTGGCCGATCTTTTAAAAATATGCGAATTCTTTTAGCAATCT ACAAATCTTTTTAAAATATATAATTGAAAATCTGCTAAATTTGTTGGAACCTTGACTGTTCTTTTTAAA ATATGCAAATTCTTTTAGCAACTTGCAAATTCTTTAGCAATCTACAAATCTTTTTAAAACATATAAAT GAAAATGGACCAATTTTTCTAGCCCCTAAATTTTTTCTAGCCCCTTGCTTTTCCTTCCAAATACCCTAC CTAATTTTGCATCTAACAGGCCCAATCATTTAACCTTTTCAGGGCTCGAGAATCTGGAAGCTTATCGG AAGCTTGATTAGCAGGATCCACCTGGAAGCCTTTATATTGATAATCTCCTTTACTCTGTCGACTATGA AGATTTCTTCGTCATCGTCAATGTACCCGACGTCGCCTGTGTGGAGCCAGCCTTCTTCATCGATTGTA GCGGCCGTGGATTCCGGGTCGTTAATATATCCTTTCATTATTTCGGGTCCGCGGATGCAGATTTCGCC GGCTTGATTGTGCGGGAGAGACTCGCCAGTTTCTGTATCGAGGATCTTTATTTGAGCGTTCCGGACGA CTGTTCCGCAGGAGCCAGATTTGACGGGGAAAGGATTCTTTGCGAAGGCTAGGTTCATTGCCAGCAC CGGGCCTGCTTCTGTCATGCCGTATCCCTGCCCAAGGATAAGCTTCCGATGGGTGTTATTTGTGGATA ATAAATTCGGGTGATGTTCAGTGTTTGTCGTATTTCTCACGAATAAATTGTGTTTATGTATGTGTTAG TGTTGTTTGTCTGTTTCAGACCCTCTTATGTTATATTTTTCTTTTCGTCGGTCAGTTGAAGCCAATACT GGTGTCCTGGCCGGCACTGCAATACCATTTCGTTTAATATAAAGACTCTGTTATCCGTGAGCTCGAAT TTCCCCGATCGTTCAAACATTTGGCAATAAAGTTTCTTAAGATTGAATCCTGTTGCCGGTCTTGCGAT GATTATCATATAATTTCTGTTGAATTACGTTAAGCATGTAATAATTAACATGTAATGCATGACGTTAT TTATGAGATGGGTTTTTATGATTAGAGTCCCGCAATTATACATTTAATACGCGATAGAAAACAAAATA TAGCGCGCAAACTAGGATAAATTATCGCGCGCGGTGTCATCTATGTTACTAGATCGC 15 Intron Sequence PDK CTCGAGTTGGTAAGGAAATAATTATTTTCTTTTTTCCTTTTAGTATA4ATAGTTAAGTGATGTTAATT AGTATGATTATAATAATATAGTTGTTATAATTGTGAAAAAATAATTTATAAATATATTGTTTACATAA ACAACATAGTAATGTAAAAAAATATGACAAGTGATGTGTAAGACGAAGAAGATAAAAGTTGAGAGT AAGTATATTATTTTTAATGAATTTGATCGAACATGTAAGATGATATACTAGCATTAATATTTGTTTTA ATCATAATAGTAATTCTAGCTGGTTTGATGAATTAAATATCAATGATAAAATACTATAGTAAAAATAA GAATAAATAAATTAAAATAATATTTTTTTATGATTAATAGTTTATTATATAATTAAATATCTATACCAT TACTAAATATTTTAGTTTAAAAGTTAATAAATATTTTGTTAGAAATTCCAATCTGCTTGTAATTTATCA ATAAACAAAATATTAAATAACAAGCTAAAGTAACAAATAATATCAAACTAATAGAAACAGTAATCTA ATGTAACAAAACATAATCTAATGCTAATATAACAAAGCGCAAGATCTATCATTTTATATAGTATTATT TTCAATCAACATTCTTATTAATTTCTAAATAATACTTGTAGTTTTATTAACTTCTAAATGGATTGACTA TTAATTAAATGAATTAGTCGAACATGAATAAACAAGGTAACATGATAGATCATGTCATTGTGTTATCA TTGATCTTACATTTGGATTGATTACAGTTGCTCGAG 16 Primers to amplify PDK CTCGAGTTGGTAAGGAAATAATTATTTTCTTTTTT intron oARB633 17 Primers to amplify PDK CTCGAGCAACTGTAATCAATCCAAATGTAAGATC intron oARB634 18 Pine4CL ATTCAATTCTTCCCACTGCAGGCTACATTTGTCAGACACGTTTTCCGCCATTTTTCGCCTGTTTCTGCG Frag-A 1-334 334nuc) GAGAATTTGATCAGGTTCGGATTGGGATTGAATCAATTGAAAGGTTTTTATTTTCAGTATTTCGATCG CCATGGCCAACGGAATCAAGAAGGTCGAGCATCTGTACAGATCGAAGCTTCCCGATATCGAGATCTC CGACCATCTGCCTCTTCATTCGTATTGCTTTGAGAGAGTAGCGGAATTCGCAGACAGACCCTGTCTGA TCGATGGGGCGACAGACAGAACTTATTGCTTTTCAGAGGTGGAACTGATTTCTCGCAAGGTC 19 Pine4CL GCTGCCGGTCTGGCGAAGCTCGGGTTGCAGCAGGGGCAGGTTGTCATGCTTCTCCTTCCGAATTGCAT Frag-B 335-668 (334nuc) CGAATTTGCGTTTGTGTTCATGGGGGCCTCTGTCCGGGGCGCCATTGTGACCACGGCCAATCCTTTCT ACAAGCCGGGCGAGATCGCCAAACAGGCGAAGGCCGCGGGCGCGCGCATCATAGTTACCCTGGCAGC TTATGTTGAGAAACTGGCCGATCTGCAGAGCCACGATGTGCTCGTCATCACAATCGATGATGCTCGCA AGGAAGGTTGCCAACATATTTCCGTTCTGACCGAAGCCGACGAAACCCAATGcCCGGCCGTGA 20 Pine4CL CAATCCACCCGGACGATGTCGTGGCGTTGCCCTATTCTTCCGGAACCACGGGGCTCCCCAAGGGCGTGATG Frag-C 669-1002 (334nuc) TTAACGCACAAAGGCCTGGTGTCCAGCGTTGCCCAGCAGGTCGATGGTGAAAATCCCAATCTGTATTTCCAT TCCGATGACGTGATACTCTGTGTCTTGCCTCTTTTCCACATCTATTCTCTCAATTCGGTTCTCCTCTGCGCGCT CAGAGCCGGGGCTGCGACCCTGATTATGCAGAAATTCAACCTCACGACCTGTCTGGAGCTGATTCAGAAATA CAAGGTTACCGTTGCCCCAATTGTGCCTCCAATTGTCCTGGACAT 21 Pine4CL Frag-D CACAAAGAGCCCCATCGTTTCCCAGTACGATGTCTCGTCCGTCCGGATAATCATGTCCGGCGCTGCGC 1003-1336 (334nuc) CTCTCGGGAAGGAACTGGAAGATGCCCTCAGAGAGCGTTTTCCCAAGGCCATTTTCGGGCAGGGCTA CGGCATGACAGAAGCAGGCCCGGTGCTGGCAATGAACCTAGCCTTCGCAAAGAATCCTTTCCCCGTC AAATCTGGCTCCTGCGGAACAGTCGTCCGGAACGCTCAAATAAAGATCCTCGATACAGAAACTGGCG AGTCTCTCCCGCACAATCAAGCCGGCGAAATCTGCATCCGCGGACCCGAAATAATGAAAGGATAT 22 Pine4CL Frag-E ATTAACGACCCGGAATCCACGGCCGCTACAATCGATGAAGAAGGCTGGCTCCACACAGGCGACGTCG 1337-1670 (334nuc) GGTACATTGACGATGACGAAGAAATCTTCATAGTCGACAGAGTAAAGGAGATTATCAAATATAAGGG CTTCCAGGTGGCTCCTGCTGAGCTGGAAGCTTTACTTGTTGCTCATCCGTCAATCGCTGACGCAGCAG TCGTTCCTCAAAAGCACGAGGAGGCGGGCGAGGTTCCGGTGGCGTTCGTGGTGAAGTCGTCGGAAAT CAGCGAGCAGGAAATCAAGGAATTCGTGGCAAAGCAGGTGATTTTCTACAAGAAAATACACAGAG 23 Pine4CL Frag-F TTTACTTTGTGGATGCGATTCCTAAGTCGCCGTCCGGCAAGATTCTGAGAAAGGATTTGAGAAGCAG 1671-1997 (327nuc) ACTGGCAGCAAAATGAAAATGAATTTCCATATGATTCTAAGATTCCTTTGCCGATAATTATAGGATTC CTTTCTGTTCACTTCTATTTATATAATAAAGTGGTGCAGAGTAAGCGCCCTATAAGGAGAGAGAGAGC TTATCAATTGTATCATATGGATTGTCAACGCCCTACACTCTTGCGATCGCTTTCAATATGCATATTACT ATAAACGATATATGTTTTTTTTATAAATTTACTGCACTTCTCGTTCAAAAAAAAA 24 Pine4CL Frag-G CCTTCGCAAAGAATCCTTTCCCCGTCAAATCTGGCTCCTGCGGAACAGTCGTCCGGAACGCTCAAATA 1121-1493 (373nuc) AAGATCCTCGATACAGAAACTGGCGAGTCTCTCCCGCACAATCAAGCCGGCGAAATCTGCATCCGCG GACCCGAAATAATGAAAGGATATATTAACGACCCGGAATCCACGGCCGCTACAATCGATGAAGAAGG CTGGCTCCACACAGGCGACGTCGGGTACATTGACGATGACGAAGAAATCTTCATAGTCGACAGAGTA AAGGAGATTATCAAATATAAGGGCTTCCAGGTGGCTCCTGCTGAGC see Pine 4CL Frag-H 48 25 Primers to amplify AATCGATACTGCAGGCGCCACCACCAAACGCTCA e. gradis 4CL clone 26 Primers to amplify AATCGATACTGCAGACTCGGAGATGTTCTCGAAG e. gradis 4CL clone 27 Euc 4CL gcgccaccaccaaacgctcaccttctcatcatcagccctctgtctctgtctctgtctctcgattctccgccccgccacgacaatggaggcgaagccgtcggagcagccc 200 bp fragment (1-200) cgcgagttcatcttccggtcgaagctccccgacatctacattcccgacaacctctccctccacgcctactgcttcgagaacatctccgagt 28 Euc 4CL Tcgccgaccgcccctgcgtcatcaacggggccaccggccggacctacacctatgccgaggtcgagctgatctcccgccgggtctcagccggcctcaacgggctcg 223 bp fragment gcgtcggacagggcgacgtgatcatgctgctcctccagaactgccctgagttcgtgttcgcgttcctcggcgcgtcctaccggggcgccatcagcacgaccgcgaac (201-423) ccgttctacac 29 Euc 4CL gcgccggagggctgcctgcacttctcggaattgatgcaggcggacgagaacgccgcccccgcggcggacgtcaagccggacgacgtcttggcgctcccctattcgt 300 bp fragment cgggcacgacggggcttcccaagggagtgatgcttacgcacaggggtcaagtgaccagcgtggcgcagcaggtcgacggagacaaccccaacttgtacttccaca (551-850) aggaggacgtgatcctgtgcacgctcccgttgttccacatatactccctcaactcggtgatgttctgcgcgctccgtgtcggcgccgcc 30 Euc 4CL gagctcgaggacaccgtgcgagccaagctgcccaatgccaagctcggacagggctatgggatgacggaggcgggcccggtgctggcaatgtgcccggcatttgca 336 bp fragment aaggagccgttcgagatcaagtcaggcgcatgcgggaccgtcgtgaggaacgcggagatgaagatcgtcgacccggagacaggggcctcgctcccgcggaacca (1031-1378) ggccggcgagatctgcatccggggtcaccagatcatgaaaggttatctgaacgacgccgaggcgaccgcaaataccatagacaaagaagggtggctgcacaccgg cgacatcggctacatagacgatgacgacgagctc 31 Euc 4CL ttcctgttgcattcgtggtgaaatccaatggttccgtaatcaccgaggacgaaatcaagcaatacatctcgaagcaggtcgtgttttacaagaggatcaagcgggttttcttc 500 bp fragment acggacgcaattccgaaagccccctccggaaaaatcttgaggaaggacctaagagcaaagttggcctctggtgtttacaattaatttctcatacccttttctttttcaaccct (1521-2020) gcccctgtacttgcttaaagacccatgtagttgaaatgaatgtaacctcttcggaggggccaaatatggaagggggaaagaaagacatatggcgatgatttgatttcacat gctattgtaatgtatttattgtttcaattccgaattagacaaagtgcttaaagctctcttttcggattttttttttcattaatgtataataattgcggacattacaatatactgtacaac gtgatttgagcttgatgaattacaagattggaagaacttcgaa 32 Complete RNAi casette GGCCGCAAAACCCCTCACAAATACATAAAAAAAATTCTTTATTTAATTATCAAACTCTCCACTACCTT for pARB583 TCCCACCAACCGTTACAATCCTGAATGTTGGAAAAAACTAACTACATTGATATAAAAAAACTACATTA CTTCCTAAATCATATCAAAATTGTATAAATATATCCACTCAAAGGAGTCTAGAAGATCCACTTGGACA AATTGCCCATAGTTGGAAAGATGTTCACCAAGTCAACAAGATTTATCAATGGAAAAATCCATCTACC AAACTTACTTTCAAGAAAATCCAAGGATTATAGAGTAAAAAATCTATGTATTATTAAGTCAAAAAGA AAACCAAAGTGAACAAATATTGATGTACAAGTTTGAGAGGATAAGACATTGGAATCGTCTAACCAGG AGGCGGAGGAATTCCCTAGACAGTTAAAAGTGGCCGGAATCCCGGTAAAAAAGATTAAAATTTTTTT GTAGAGGGAGTGCTTGAATCATGTTTTTTATGATGGAAATAGATTCAGCACCATCAAAAACATTCAG GACACCTAAAATTTTGAAGTTTAACAAAAATAACTTGGATCTACAAAAATCCGTATCGGATTTTCTCT AAATATAACTAGAATTTTCATAACTTTCAAAGCAACTCCTCCCCTAACCGTAAAACTTTTCCTACTTCA CCGTTAATTACATTCCTTAAGAGTAGATAAAGAAATAAAGTAAATAAAAGTATTCACAAACCAACAA TTTATTTCTTTTATTTACTTAAAAAACAAAAGTTTATTTATTTTACTTAAATGGCATAATGACATAT CGGAGATCCCTCGAACGAGAATCTTTTATCTCCCTGGTTTTGTATTAAAAAGTAATTTATTGTGGGGT CCACGCGGAGTTGGAATCCTACAGACGCGCTTTACATACGTCTCGAGAAGCGTGACGGATGTGCGAC CGGATGACCCTGTATAACCCACCGACACAGCCAGCGCACAGTATACACGTGTCATTTCTCTATTGGAA AATGTCGTTGTTATCCCCGCTGGTACGCAACCACCGATGGTGACAGGTCGTCTGTTGTCGTGTCGCGT AGCGGGAGAAGGGTCTCATCCAACGCTATTAAATACTCGCCTTCACCGCGTTACTTCTCATCTTTTCT CTTGCGTTGTATAATCAGTGCGATATTCTCAGAGAGCTTTTCATTCAAAGGTATGGAGTTTTGAAGGG CTTTACTCTTAACATTTGTTTTTCTTTGTAAATTGTTAATGGTGGTTTCTGTGGGGGAAGAATCTTTTG CCAGGTCCTTTTGGGTTTCGCATGTTTATTTGGGTTATTTTTCTCGACTATGGCTGACATTACTAGGGC TTTCGTGCTTTCATCTGTGTTTTCTTCCCTTAATAGGTCTGTCTCTCTGGAATATTTAATTTTCGTATGT AAGTTATGAGTAGTCGCTGTTTGTAATAGGCTCTTGTCTGTAAAGGTTTCAGCAGGTGTTTGCGTTTT ATTGCGTCATGTGTTTCAGAAGGCCTTTGCAGATTATTGCGTTGTACTTTAATATTTTGTCTCCAACCT TGTTATAGTTTCCCTCCTTTGATCTCACAGGAACCCTTTCTTCTTTGAGCATTTTCTTGTGGCGTTCTG TAGTAATATTTTAATTTTGGGCCCGGGTTCTGAGGGTAGGTGATTATTCACAGTGATGTGCTTTCCCT ATAAGGTCCTCTATGTGTAAGCTGTTAGGGTTTGTGCGTTACTATTGACATGTCACATGTCACATATT TTCTTCCTCTTATCCTTCGAACTGATGGTTCTTTTTCTAATTCGTGGATTGCTGGTGCCATATTTTATTT CTATTGCAACTGTATTTTAGGGTGTCTCTTTCTTTTTGATTTCTTGTTAATATTTGTGTTCAGGTTGTA ACTATGGGTTGCTAGGGTGTCTGCCCTCTTCTTTTGTGCTTCTTTCGCAGAATCTGTCCGTTGGTCTGT ATTTGGGTGATGAATTATTTATTCCTTGAAGTATCTGTCTAATTAGCTTGTGATGATGTGCAGGTATA TTCGTTAGTCATATTTCAATTTCAAGCGATCCCCCGGGCTGCAGGCGCCACCACCAAACGCTCACCTT CTCATCATCAGCCCTCTGTCTCTGTCTCTGTCTCTCGATTCTCCGCCCCGCCACGACAATGGAGGCGA AGCCGTCGGAGCAGCCCCGCGAGTTCATCTTCCGGTCGAAGCTCCCCGACATCTACATTCCCGACAAC CTCTCCCTCCACGCCTACTGCTTCGAGAACATCTCCGAGTCTGCAGGAATTCGTCCAGCAGTAATTCG ATTCTCGAGTTGGTAAGGAAATAATTATTTTCTTTTTTCCTTTTAGTATAAAAGTTAAGTGATGTTA ATTAGTATGATTATAATAATATAGTTGTTATAATTGTGAAAAAATAATTTATAAATATATTGTTTACA TAAACAACATAGTAATGTAAAAAAATATGACAAGTGATGTGTAAGACGAAGAAGATAAAAGTTGAG AGTAAGTATATTATTTTTAATGAATTTGATCGAACATGTAAGATGATATACTAGCATTAATATTTGTT TTAATCATAATAGTAATTCTAGCTGGTTTGATGAATTAAATATCAATGATAAAATACTATAGTAAAAA TAAGAATAAATAAATTAAAATAATATTTTTTTATGATTAATAGTTTATTATATAATTAAATATCTATA CCATTACTAAATATTTTAGTTTAAAAGTTAATAAATATTTTGTTAGAAATTCCAATCTGCTTGTAATTT ATCAATAAACAAAATATTAAATAACAAGCTAAAGTAACAAATAATATCAAACTAATAGAAACAGTAA TCTAATGTAACAAAACATAATCTAATGCTAATATAACMkAGCGCAAGATCTATCATTTTATATAGTAT TATTTTCAATCAACATTCTTATTAATTTCTAAATAATACTTGTAGTTTTATTAACTTCTAAATGGATTG ACTATTAATTAAATGAATTAGTCGAACATGAATAAACAAGGTAACATGATAGATCATGTCATTGTGTT ATCATTGATCTTACATTTGGATTGATTACAGTTGCTCGAGAATCACTAGTGAATTAAATCTGGAAGCT TATCGATACTGCAGACTCGGAGATGTTCTCGAAGCAGTAGGCGTGGAGGGAGAGGTTGTCGGGAATG TAGATGTCGGGGAGCTTCGACCGGAAGATGAACTCGCGGGGCTGCTCCGACGGCTTCGCCTCCATTG TCGTGGCGGGGCGGAGAATCGAGAGACAGAGACAGAGACAGAGGGCTGATGATGAGAAGGTGAGCG TTTGGTGGTGGCGCCTGCAGTATCGATGGGTGTTATTTGTGGATAATAAATTCGGGTGATGTTCAGTG TTTGTCGTATTTCTCACGAATAAATTGTGTTTATGTATGTGTTAGTGTTGTTTGTCTGTTTCAGACCCT CTTATGTTATATTTTTCTTTTCGTCGGTCAGTTGAAGCCAATACTGGTGTCCTGGCCGGCACTGCAATA CCATTTCGTTTAATATAAAGACTCTGTTATCCGTGAGCTCGAATTTCCCCGATCGTTCAAACATTTGG CAATAAAGTTTCTTAAGATTGAATCCTGTTGCCGGTCTTGCGATGATTATCATATAATTTCTGTTGAA TTACGTTAAGCATGTAATAATTAACATGTAATGCATGACGTTATTTATGAGATGGGTTTTTATGATTA GAGTCCCGCAATTATACATTTAATACGCGATAGAAAACAAAATATAGCGCGCAAACTAGGATAAATT ATCGCGCGCGGTGTCATCTATGTTACTAGATCGC 33 Euc 4CL 200 bp atttgatttcacatgctattgtaatgtatttattgtttcaattccgaattagacaaagtgcttaaagctctcttttcggattttttttttcattaatgtataataattgcggacattaca fragment (1844-2043) atatactgtacaacgtgatttgagcttgatgaattacaagattggaagaacttcgaagacaaaaaaaaaaaaaaaaaaaa 34 Euc 4CL 600 bp gcgccaccaccaaacgctcaccttctcatcatcagccctctgtctctgtctctgtctctcgattctccgccccgccacgacaatggaggcgaagccgtcggagcagccc fragment (1-600) cgcgagttcatcttccggtcgaagctccccgacatctacattcccgacaacctctccctccacgcctactgcttcgagaacatctccgagttcgccgaccgcccctgcgt catcaacggggccaccggccggacctacacctatgccgaggtcgagctgatctcccgccgggtctcagccggcctcaacgggctcggcgtcggacagggcgacgt gatcatgctgctcctccagaactgccctgagttcgtgttcgcgttcctcggcgcgtcctaccggggcgccatcagcacgaccgcgaacccgttctacaccccgggcga gatcgccaagcaggcctcagctgcccgggccaagatcgtgatcacgcaggccgcgttcgccgacaaggtgaggccgttcgcggaggagaacggggtgaaggtcg tgtgcatcgataccgcgccggagggctgcctgcacttctcggaattgatgcaggcggacgagaa 35 Euc Arabinogalactan AAATACATGCCAGTGTGGAATAACTATGCGAAGTTATCATTTGGTGCACTTGCTTGGGTGAACTTGAT Promoter GCCTTACTGAAGTTTTATTTTTGACCATCTTTGTTGTGATTTAACATATTTGAGCGCTACCGTACTTAT GACACTTAAATGATGAAAGTTGCTGTAGGGTGAATTTGGCTGTTTGACGCATGGAGATTAGGCATTA ACCTTTCTTAGTTATGCTGATTATTTCTTGTGTGTCTTTTTTTCCCCCTCCTTCAGCATCACTTGTTTGC AAGTGGAAGAGATATGACTTTCTTTCAGGTACTTGTTTTCATACCCATATTAATACATCTGGTTAAAT CATGAAATTTTTGTATTGATCGTTTGTATGTCCAATGACAGTATGACCTATTCAATGACATTTGGTTGT GTGCTAGATTTCGTTCCAGAGAAAATGAAAGCAGAAGATGCATTGGCAGAGAGGAAACCAGAAGAG ACATGAATATGATACTAATCTTAGGTCAAGAAGCTGTAACTTTCATTGATTGAGGGGCTTCAATTTGT ATGAGCATCTTATACTGTGATTTGGTTCTTTTCCTGCTATAGCAGAATAGAGCCAGCAAAATGGGCAC TTACATTTAGCTGCAGATGATGTCTGTATGGGCGAATTTTTTCGCATGTTACATTGGAGAAGAGAAAT GCTTATACTTCTGGTAATTTTTTCAGCAAATAGTCTCATGCCCTGCTAACATGGATGGTGGGATAGCT TCTTCTGGGGAGTGTAATTAATCTGTCATGGACAAGTACTTTGTAGTTAATCTGATTCTCGGCCTATG TTATATCTGTTTTGCGTTATACTAAAGATATTCAGATCAATCTATGTCAATCTATTCACGAAAACCCG GGGAGTCTAATGAGGAGAGTTGCATCTTGGCAATATAGTTTTTAAGAATGGATATCCAGATCCCTAC GAACTGGATTCACACAGTCACTGCTGTAAGCTCTGGTTTTTTTTAGCTTAGGAAGCAGGTTATAATCA AAGATGATTAAACCATCGCGTGTTCGCCAGCCATCAGAAATGGAAAGGCAAATGTTGTTATAGTGAT GGAGAGATCATGCTGAGATGATTGATTATGAATCTTACTGATGACTGTCATTTATGTTATCGCACTCT GTGTGTGTGGGTGTGTGTAATGAGTAATATCAAATTAACCAGACGATAGGTGTTGAAGATTAGCTGT TGGGCCGCCGTGGCAAAAGGTGTCTTATACAAGCCATCGGCAGTGACGCAGAACTGTAGAGAACCGC TGTAACAAGTCTTCGAATGCATTCTTTTAATGTACAGCACGACATGAAGGGGGTTCAAGTGTAGCGA ACAGTTCGTGCGAGAAAGATCATTTTCAATAGCATAAAAGAGTCTGCTCTCTGCTGCAAACATGGAA AGAACTTACATTTCAATCATTGAGGAGAAGATTATAACAAATCCTAAATGGTTGGGATTTTAGTTAGT CCATTCGAACTAAAGTGGCGAAGATGTCAGTTTTTCAAGTGGATGATATTTCTCATGTATGTTCCGCA GAGGCAATCACCTTGTTTGTAACTAGACATCTAGAGAACCTAACAAGGATTGATGGGGGTGAGGTGA AATGTCTGTTTCCTCTTTAATATGGATCCAGCGATGCCTTACAGAGCGGATGGATGGCACTGGCAAGT CTTAATCCTTAGGTCGAATGTTTGATTGGTAACAGATGCCFFTTCTTTCTTTTCAATCACAGCTGACAA ATGCAAATATCTAAAACCATTGGCTGTTTGGTGCTTGCAAGTCTGGATTACCCCACTTTATGTTTCAC CTTTCAATAATGAATAACAAGGTACTCGGGAAAAAAAGGAAAGGGAAATTCGCACAACCAAAGTTGC TATGCAGAAGTCAACTCAATCCTAATCAAGTTGATGAGAGTGTTGGGCCCTATTTTCTGCAGCAAACA TGAATCTCGATTCATCTCCCTCGCAAAAGATAAGGAAGCTGCAAAAGCTTTCCTCCTAAGTTTGTTGG CAGGCAAATTGATTTTGTACCAGAAATAAATACAAAGTGAAACCCAAGCAATCACGCATGGCCTGAT TTGTGCCATGTCCATTTGATCTCCCTCTACCATTTTTCCTGCTTTCTCAAGCAAACTAGTTGCTGTAAC AGTGAATGATCCCCCGGCTCTCTCTCTCTCTCTCTCTCTCTCCATTTATTCCATCCATGTTTTTGCTTTT CGCACAACACTTATCATTGAGGTGCTAACTACTGAATTCCCCTAACTAAAAATTGGAACCTCTCACCT AATTTCATTTTCTCCCACTTTGATGAGCACCACTCTCTTTCCCAGATTTCAAATAAATTGCCACTCTCT CCCTCCTCTTTCCTCACACAACCAAAAGCCTTCTTCAAGTACCACTTCTTCACTGTCC 36 ColE1-F4 (primer to GAGAGAGGATCCGGTGTGAAATACCGCACAG ColE1 replication) 37 ColE1-R4 (primer to GAGAGATGATCAGCCTCACTGATTAAGCATTGGTAACTG ColE1 replication) 38 Pr LIM FragA 1 to 390 gtagatttaaatgcttttttgaaatccggttactcgcaagattatcaatcgggactgtagccgaagctttgagaggttgaaattcagacttttgctccgaactgttctgctgaaa caaaatccagtattgagctaggtttagaatcgggtttgctggtcatctgggagaggcgatccattcagcttcgcaggcccccgaagatggcgttcgccggcacaaccca gaagtgcaaggcatgtgaaaagacggtctatttggttgatcaattgacagctgataattctgtttttcacaaatcctgtttccgctgccatcactgcaatggaactttaaagctt agcaactattcgtcgtttgagggagttctatattgcaaacctcattttgac 39 Pr LIM FragB 391 to 780 cagctgtttaagagaacaggaagtttggataaaagttttgaagccattcctagagcatcaagaaatgacaagatgcatgagaatgagaacaggacacctagtagggtat cagcattgttttccggtacacaggataaatgtgttgcatgtgggaagacagtgtaccccattgagaaggttgctgttgatggtacatcataccaccgaccatgcttcaagtg ctgtcatggtggttgtgtcatcagcccctcaaattatgttgctcatgaaggcaggctatattgtaggcatcatagctctcaactttttagggagaaaggtaacttcagccagct ttcaaaggcaacacctacaaaaggggtgactgagaactcagacacagacgacaag 40 Euc LIM ggcttccctttcttatcctccattctcctctctccttctccttacactcacagacacaatcacagagagagagagagagagagagagagagagagagagagaatggcatt “164 bp frag” 1-164 cgcaggaacaacccagaagtgcatggcctgtgagaagacagtctatctggtgga (164nuc) 41 Euc LIM Ggcttccctttcttatcctccattctcctctctccttctccttacactcacagacacaatcacagagagagagagagagagagagagagagagagagagagaatggcatt “455 bp frag” 1-455 cgcaggaacaacccagaagtgcatggcctgtgagaagacagtctatctggtggacaagctcacagctgacaatagaatctaccacaaggcctgcttcagatgccacc (455nuc) attgcaaagggactctcaagcttgggaactataattcatttgaaggagtcttgtactgccggccgcatttcgatcagctcttcaagagaactggcagcctcgaaaaaagctt tgaaggaacccccaagattgcaaagccagagaaacccgtcgatggagagagacctgcagcgaccaaagcctccagtatgttcgggggaacgcgagacaaatgtgt aggctgtaagagcaccgtcta 42 Pine CCo-OMT fragA AGGTTTAAGGAAATGGCAGGCACAAGTGTTGCTGCAGCAGAGGTGAAGGCTCAGACAACCCAAGCA 20nuc-570nuc GAGGAGCCGGTTAAGGTTGTCCGCCATCAAGAAGTGGGACACAAAAGTCTTTTGCAGAGCGATGCCC TCTATCAGTATATATTGGAAACGAGCGTGTACCCTCGTGAGCCCGAGCCAATGAAGGAGCTCCGCGA AGTGACTGCCAAGCATCCCTGGAACCTCATGACTACTTCTGCCGATGAGGGTCAATTTCTGGGCCTCC TGCTGAAGCTCATTAACGCCAAGAACACCATGGAGATTGGGGTGTACACTGGTTACTCGCTTCTCAGC ACAGCCCTTGCATTGCCCGATGATGGAAAGATTCTAGCCATGGACATCAACAGAGAGAACTATGATA TCGGATTGCCTATTATTGAGAAAGCAGGAGTTGCCCACAAGATTGACTTCAGAGAGGGCCCTGCTCT GCCAGTTCTGGACGAACTGCTTAAGAATGAGGACATGCATGGATCGTTCGATTTTGTGTTCGTGGATG CGGACAAAGACAA 43 Pinus radiata CCoAOMT gaaggaatttggtaggcaactatgtatatcactatattatatgcattttctcgagatgtctaatctcatttgtgtcccacctccctggaccggctaatgatttgactatctttgtttta No.3 793-1016nuc aaggaagcaaacttggtgtaggattctctccaacttcaatgatgcaataagcaagaggataaatgtcattatctttcatggacggagcacaaatggctttttacac 44 Eucalyptus grandis tcgcaccagaaaggagatctcaaaatcaagcattgatgaaatgagaaactacccttaatactttccttcctttctattttttccatcttctgtcttatgttgtctttgaaccattgag CCoAOMT 745-904nuc catgtatttgtattcaaatgaacgattaaggattgagaagaac 45 Eucalyptus grandis caccccggtgaagcagtgcctgtacgaaactgtcaagagcttgcaggagaaaggccacctacccgtccctcccccgccggaagattcggtgcgtattcagggatgat CCR 1038-1326nuc cttagatccatcacggtgcgcatttgtaatccggagaaatgagagaaacatgtgggaatttgttgtacttttctaagtcaaacctggagataccaaccctgagttctgcatt ggaatggaagttgtcaattgatcaatcgtcgcaagttatcgttggcagaaacggaatgtcagttaccat 46 Eucalyptus grandis C3H GAAGCTTGGCGCATCGCTCGCCATGGCGGAGCACATCCCGTGGCTTCGCTGGATGTTCCCGCTGGAG 600 bp GAGGAAGCGTTCGCCAAGCACAGCGCGAGGAGGGACCGCCTCACCCGGGCCATCATGGAGGAGCAC ACGGTAGCCCGCCAGAAGAGCGGGGCCAAGCAGCATTTCGTCGACGCCCTGCTCACCCTCAAGGACA AATACGACCTCAGCGAAGATACCATCATAGGACTCCTCTGGGACATGATCACAGCAGGCATGGACAC TACTGCTATTTCAGTGGAGTGGGCGATGGCGGAGCTGATCAAGAACCCGAGGGTGCAACAGAAGGCC CAAGAGGAGCTCGACCGGGTCGTCGGGTTCGAGCGTGTGGTGACTGAGTCCGACTTCTCGAACCTCC CTTACCTCCAGTGCATTGCTAAGGAAGCGCTCCGGCTGCACCCTCCGACCCCGCTGATGCTCCCCCAC CGGTCCAACTCCCACGTCAAGATCGGCGGCTACGACATCCCCAAGGGGTCGAACGTCCACGTGAATG TATGGGCCATCGCCCGCGACCCGGCCGTCTGGAATAGCCCGCTCGAGTTCAGGCCCGAGCGGTTC 47 Eucalyptus grandis C4H CCCTGAGGCTCCGGATGGCGATCCCGCTCCTCGTGCCCCACATGAACCTCCACGACGCCAAGCTCGGG 600 bp GGCTACGACATCCCCGCCGAGAGCAAGATCCTGGTCAACGCGTGGTGGCTGGCCAACAACCCTGCCC ACTGGAAGAAGGCCGAGGAGTTCCGGCCCGAGCGGTTCCTGGAGGAGGAGGCGAAGGTCGAGGCCA ACGGGAACGACTTCCGGTACCTCCCCTTCGGAGTCGGCCGGAGGAGCTGCCCTGGGATCATCCTGGC CCTGCCCATCCTCGGGGTCACCATCGGCCAGTTGGTGCAGAACTTCGAGCTCTTGCCGCCCCCTGGAC AATCGAAGCTCGACACCACTGAGAAGGGTGGCCAATTCAGCTTGCACATATTGAAGCACTCCACCAT CGTCTTGAAGCCAAGATCCTTTTGAAGTTAGTCTCCACAGAGATTCAACTTTTGGTGGCTGTTGATTT CACTTGGACAGTATTAAAATATGAAGAATTGGACAAAGCATATTCAGGAGTTGCCATGAGAACTTAT GTTGTGTCTTGTGTTGGGAAAATAACAGCTTTTATGTCCTTTGAGAACTGAAACTTATCTTTTG 48 Pine 4CL Frag-H 1-668 ATTCAATTCTTCCCACTGCAGGCTACATTTGTCAGACACGTTTTCCGCCATTTTTCGCCTGTTTCTGCG GAGAATTTGATCAGGTTCGGATTGGGATTGAATCAATTGAAAGGTTTTTATTTTCAGTATTTCGATCG CCATGGCCAACGGAATCAAGAAGGTCGAGCATCTGTACAGATCGAAGCTTCCCGATATCGAGATCTC CGACCATCTGCCTCTTCATTCGTATTGCTTTGAGAGAGTAGCGGAATTCGCAGACAGACCCTGTCTGA TCGATGGGGCGACAGACAGAACTTATTGCTTTTCAGAGGTGGAACTGATTTCTCGCAAGGTCGCTGC CGGTCTGGCGAAGCTCGGGTTGCAGCAGGGGCAGGTTGTCATGCTTCTCCTTCCGAATTGCATCGAAT TTGCGTTTGTGTTCATGGGGGCCTCTGTCCGGGGCGCCATTGTGACCACGGCCAATCCTTTCTACAAG CCGGGCGAGATCGCCAAACAGGCCAAGGCCGCGGGCGCGCGCATCATAGTTACCCTGGCAGCTTATG TTGAGAAACTGGCCGATCTGCAGAGCCACGATGTGCTCGTCATCACAATCGATGATGCTCCCAAGGA AGGTTGCCAACATATTTCCGTTCTGACCGAAGCCGACGAAACCCAATGCCCGGCCGTGA 49 pARB310 cgccggcgttgtggatacctcgcggaaaacttggccctcactgacagatgaggggcggacgttgacacttgaggggccgactcacccggcgcggcgttgacagatg aggggcaggctcgatttcggccggcgacgtggagctggccagcctcgcaaatcggcgaaaacgcctgattttacgcgagtttcccacagatgatgtggacaagcctg gggataagtgccctgcggtattgacacttgaggggcgcgactactgacagatgaggggcgcgatccttgacacttgaggggcagagtgctgacagatgaggggcgc acctattgacatttgaggggctgtccacaggcagaaaatccagcatttgcaagggtttccgcccgtttttcggccaccgctaacctgtcttttaacctgcttttaaaccaatat ttataaaccttgtttttaaccagggctgcgccctgtgcgcgtgaccgcgcacgccgaaggggggtgcccccccttctcgaaccctcccggcccgctaacgcgggcctc ccatccccccaggggctgcgcccctcggccgcgaacggcctcaccccaaaaatggcagcgctggcagtccataattgtggtttcaaaatcggctccgtcgatactatg ttatacgccaactttgaaaacaactttgaaaaagctgttttctggtatttaaggttttagaatgcaaggaacagtgaattggagttcgtcttgttataattagcttcttggggtatc tttaaatactgtagaaaagaggaaggaaataataaatggctaaaatgagaatatcaccggaattgaaaaaactgatcgaaaaataccgctgcgtaaaagatacggaagg aatgtctcctgctaaggtatataagctggtgggagaaaatgaaaacctatatttaaaaatgacggacagccggtataaagggaccacctatgatgtggaacgggaaaag gacatgatgctatggctggaaggaaagctgcctgttccaaaggtcctgcactttgaacggcatgatggctggagcaatctgctcatgagtgaggccgatggcgtcctttg ctcggaagagtatgaagatgaacaaagccctgaaaagattatcgagctgtatgcggagtgcatcaggctctttcactccatcgacatatcggattgtccctatacgaatag cttagacagccgcttagccgaattggattacttactgaataacgatctggccgatgtggattgcgaaaactgggaagaagacactccatttaaagatccgcgcgagctgt atgattttttaaagacggaaaagcccgaagaggaacttgtcttttcccacggcgacctgggagacagcaacatctttgtgaaagatggcaaagtaagtggctttattgatct tgggagaagcggcagggcggacaagtggtatgacattgccttctgcgtccggtcgatcagggaggatatcggggaagaacagtatgtcgagctattttttgacttactg gggatcaagcctgattgggagaaaataaaatattatattttactggatgaattgttttagtacctagatgtggcgcaacgatgccggcgacaagcaggagcgcaccgactt cttccgcatcaagtgttttggctctcaggccgaggcccacggcaagtatttgggcaaggggtcgctggtattcgtgcagggcaagattcggaataccaagtacgagaag gacggccagacggtctacgggaccgacttcattgccgataaggtggattatctggacaccaaggcaccaggcgggtcaaatcaggaataagggcacattgccccgg cgtgagtcggggcaatcccgcaaggagggtgaatgaatcggacgtttgaccggaaggcatacaggcaagaactgatcgacgcggggttttccgccgaggatgccga aaccatcgcaagccgcaccgtcatgcgtgcgccccgcgaaaccttccagtccgtcggctcgatggtccagcaagctacggccaagatcgagcgcgacagcgtgcaa ctggctccccctgccctgcccgcgccatcggccgccgtggagcgttcgcgtcgtctcgaacaggaggcggcaggtttggcgaagtcgatgaccatcgacacgcgag gaactatgacgaccaagaagcgaaaaaccgccggcgaggacctggcaaaacaggtcagcgaggccaagcaggccgcgttgctgaaacacacgaagcagcagat caaggaaatgcagctttccttgttcgatattgcgccgtggccggacacgatgcgagcgatgccaaacgacacggcccgctctgccctgttcaccacgcgcaacaagaa aatcccgcgcgaggcgctgcaaaacaaggtcattttccacgtcaacaaggacgtgaagatcacctacaccggcgtcgagctgcgggccgacgatgacgaactggtgt ggcagcaggtgttggagtacgcgaagcgcacccctatcggcgagccgatcaccttcacgttctacgagctttgccaggacctgggctggtcgatcaatggccggtatt acacgaaggccgaggaatgcctgtcgcgcctacaggcgacggcgatgggcttcacgtccgaccgcgttgggcacctggaatcggtgtcgctgctgcaccgcttccg cgtcctggaccgtggcaagaaaacgtcccgttgccaggtcctgatcgacgaggaaatcgtcgtgctgtttgctggcgaccactacacgaaattcatatgggagaagtac cgcaagctgtcgccgacggcccgacggatgttcgactatttcagctcgcaccgggagccgtacccgctcaagctggaaaccttccgcctcatgtgcggatcggattcc acccgcgtgaagaagtggcgcgagcaggtcggcgaagcctgcgaagagttgcgaggcagcggcctggtggaacacgcctgggtcaatgatgacctggtgcattgc aaacgctagggccttgtggggtcagttccggctgggggttcagcagccagcgctttactggcatttcaggaacaagcgggcactgctcgacgcacttgcttcgctcagt atcgctcgggacgcacggcgcgctctacgaactgccgatagacaactgtcacggttaagcgagaaatgaataagaaggctgataattcggatctctgcgagggagat gatatttgatcacaggcagcaacgctctgtcatcgttacaatcaacatgctaccctccgcgagatcatccgtgtttcaaacccggcagcttagttgccgttcttccgaatag catcggtaacatgagcaaagtctgccgccttacaacggctctcccgctgacgccgtcccggactgatgggctgcctgtatcgagtggtgattttgtgccgagctgccggt cggggagctgttggctggctggtggcaggatatattgtggtgtaaacaaattgacgcttagacaacttaataacacaccgcggtctagaactagtggatcccccctacgt gcgatctagtaacatagatgacaccgcgcgcgataatttatcctagtttgcgcgctatattttgttttctatcgcgtattaaaatgtataattgcgggactctaatcataaaaaccc atctcataaataacgtcatgcattacatgttaattattacatgcttaacgtaattcaacagaaattatatgataatcatcgcaagaccggcaacaggattcaatcttaagaaact ttattgccaaatgtttgaacgatccctcagaagaactcgtcaagaaggcgatagaaggcgatgcgctgcgaatcgggagcggcgataccgtaaagcagaggaagcg gtcagcccattcgccgccaagctcttcagcaatatcacgggtagccaacgctatgtcctgatagcggtccgccacacccagccggccacagtcgatgaatccagaaaa gcggccattttccaccatgatattcggcaagcaggcatcgccatgggtcacgacgagatcctcgccgtcgggcatgcgcgccttgagcctggcgaacagttcggctgg cgcgagcccctgatgctcttcgtccagatcatcctgatcgacaagaccggcttccatccgagtacgtgctcgctcgatgcgatgtttcgcttggtggtcgaatgggcaggt agccggatcaagcgtatgcagccgccgcattgcatcagccatgatggatactttctcggcaggagcaaggtgagatgacaggagatcctccccggcacttcgccca atagcagccagtcccttcccgcttcagtgacaacgtcgagcacagctgcgcaaggaacgcccgtcgtggccagccacgatagccgcgctgcctcgtcctggagttca ttcagggcaccggacaggtcggtcttgacaaaaagaaccgggcgcccctgcgctgacagccggaacacggcggcatcagagcagccgattgtctgttgtgcccagt catagccgaatagcctctccacccaagcggccggagaacctgcgtgcaatccatcttgttcaatcatagtactagttggggatctgcatctgaataaaacaatagaaca agtagaaaccaatcagcgaacatataccaaatcaaaagccgtaagagaaatcaaaacaacaccaaagagaaacggatctaaacataagaaacctaaaacagagaga atcgaacaaagaaaacacaaaaattgaatagatcgtccttgaaaatcctaatttcacaatcaagcaagaaattacacagatgtaaacactacgaatcgatatcttagtaatc aggacaaaatttagaagctggattgacgaaacgaacaatattgtcaaaagcaatttatacaaaagattcaataatccacataacaaaaattggagatcagatacgaatcaa aaacaaaaagaatcagaaaatataccttgaaagagagagtcgcgagagatttgcagagatcgctttaggctttgggagagattgaagagtcagaaaaagacgaaagg atgaattattatcttccacacgaaggtcttctttatatcgcaaaccaaaagcccaaaaccgtcttttctattaatgagaataaaatatctttagccaaaacaaaaaaaggaaga tatcagttgaggattattatcacgaaactaaaggaaggaatcatatgatacgtgtctattttccaccgtgcgtttttaaaagaccgactcaagtagaaacatcctatggtggtg gttggattaggtcatccattacatctgcttcactgacatttttctatttttctttttgtatatacttttcctcaaataatttctttcttttctatagaagaatttaatcaataaggaaaa agttcaaaaaagattctttccattaagactatgtcttggttaacccaacccattaagaataagcaatcataatatatatagagaatactaatactatatatgagatttttcttttaattt catgttgattatgatagtttatcttcttgatttaatttatcaatacttggcataaaagattctaatctactctaataaagaaaagaaaaaaaagtatctaccattgactaattaaaataa ggaaacttatctaccaaatttgagtattttttagaacaatctttttggtttaattccaaaactctaaacctaattgttgggaaaaaggacctaatttttaagaaaagttaataattaga agatctgtatgtttttttttttgatccaagtttttatttcttttctctttttttcatgataaaatctatgtttttttagtctacaattaaagtaattgttattattttctttatcttttt ttgttgttgttgttaattcccttttttttttttttaacagcaacttcttaaaaaaaaaaacagttgggccttgaatttatttcaggcctgcgttattaagcccagataataactcaaaac aaaaaaaatgttgaaccggaataaacccgcgagattaaatgccggttttcaggtaacatagaagaagaatatatgaggattgaagaagtattcaagaggcggaacaattcacaagtccaa gagcttaaatttctcctcactcttctgctacagactcggaactctttctctttgctaaaataagatgttcaggatttttgttgccgacaattcatgtatctcacactctctctcttct ctgttcttactactctgttacattaccaccaactcaagactttcttccacaatggcgtttatgagacttggctccaaatccgaagcttatcgataccgtcgacctctagaggcg cgccaagcggccgcatttaaatgggccctcgagagcccgggctcctgcaggtaccttaattaaaagtttaaactatcagtgtttgacaggatatattggcgggtaaaccta agagaaaagagcgtttattagaataatcggatatttaaaagggcgtgaaaggtttatccgttcgtccatttgtatgtgcatgccaaccacagggttcccagatc 50 primer STAR5BST GAGAGACCATAATTGTGGTCCAATTTGCAGCCGTCCGAG 51 primer STAR3BST GAGAGACCATAATTGTGGTTTGTGTTTCCATATTGTTCATC 52 UBQ10::partial NPTII ggcgcgccgtcaacggatcaggatatccttgtttaagatgttgaactctatggaggtttgtatgaactgatgatctaggaccggataagttcccttcttcatagcgaacttatt fragment caaagaatgttttgtgtatcattcttgttacattgttattaatgaaaaaatattattggtcattggactgaacacgagtgttaaatatggaccaggccccaaataagatccattga tatatgaattaaataacaagaataaatcgagtcaccaaaccacttgccttttttaacgagacttgttcaccaacttgatacaaaagtcattatcctatgcaaatcaataatcata caaaaatatccaataacactaaaaaattaaaagaaatggataatttcacaatatgttatacgataaagaagttacttttccaagaaattcactgattttataagcccacttgcat tagataaatggcaaaaaaaaacaaaaaggaaaagaaataaagcacgaagaattctagaaaatacgaaatacgcttcaatgcagtgggacccacggttcaattattgcc aattttcagctccaccgtatatttaaaaaataaaacgataatgctaaaaaaatataaatcgtaacgatcgttaaatctcaacggctggatcttatgacgaccgttagaaattgt ggttgtcgacgagtcagtaataaacggcgtcaaagtggttgcagccggcacacacgagtcgtgtttatcaactcaaagcacaaatacttttcctcaacctaaaaataagg caattagccaaaaacaactttgcgtgtaaacaacgctcaatacacgtgtcattttattattagctattgcttcaccgccttagctttctcgtgacctagtcgtcctcgtcttttcttc ttcttcttctataaaacaatacccaaagagctcttcttcttcacaattcagatttcaatttctcaaaatcttaaaaactttctctcaattctctctaccgtgatcaaggtaaatttctgt gttccttattctctcaaaatcttcgattttgttttcgttcgatcccaatttcgtatatgttctttggtttagattctgttaatcttagatcgaagacgattttctgggtttgatcgttag atatcatcttaattctcgattagggtttcataaatatcatccgatttgttcaaataatttgagttttgtcgaataattactcttcgatttgtgatttctatctagatctggtgttagttt ctagtttgtgcgatcgaatttgtcgattaatctgagtttttctgattaacagatgattgaacaagatggattgcacgcaggttctccggccgcttgggtggagaggctattcggctatg actgggcacaacagacaatcggctgctctgatgccgccgtgttccggctgtcagcgcaggggcgcccggttctttttgtcaagaccgacctgtccggtgccctgaatga actccaggacgaggcagcgcggctatcgtggctggccacgacgggcgttccttgcgcagctgtgctcgacgttgtcactgaagcgggaagggactggctgctattgg gcgaagtgccggggcaggatctcctgtcatctcaccttgctcctgccgagaaagtatccatcatggctgatgcaatgcggcggctgcatacgcttgatccggctacctgc ccattcgaccaccaagcgaaacatcgcatcgagcgagcacgtactcggatggaagcgatcaggatgatctggacgaagagcatcaggggctcgcgccagccgaac tgttcgccaggctcaaggcgcgcatgcccgacggcgaggatctcgtcgtgacccatgg 53 primer UBQ10ASC GAGAGGCGCGCCGTCAACGGATCAGGATATCCTTGTTTAAGA 54 primer UBQ10P3 TGCTGGCAATCCATCTTGTTCAATCATCTGTTAATCAGAAAAACTCAGATTA 55 primer NPT2-5A TAATCTGAGTTTTTCTGATTAACAGATGATTGAACAAGATGGATTGCACGCA 56 primer NPT2-3A TATTGCCAAATGTTTGAACGATCCCTCAGAAGAACTCGTCAAGAAGGCGATA 57 primer NOSTER5A TATCGCCTTCTTGACGAGTTCTTCTGAGGGATCGTTCAAACATTTGGCAATA 58 primer NSTR3DRA GAGACACTACGTGCGATCTAGTAACATAGATGACAC 59 pARB1001 cgccggcgttgtggatacctcgcggaaaacttggccctcactgacagatgaggggcggacgttgacacttgaggggccgactcacccggcgcggcgttgacagatg aggggcaggctcgatttcggccggcgacgtggagctggccagcctcgcaaatcggcgaaaacgcctgattttacgcgagtttcccacagatgatgtggacaagcctg gggataagtgccctgcggtattgacacttgaggggcgcgactactgacagatgaggggcgcgatccttgacacttgaggggcagagtgctgacagatgaggggcgc acctattgacatttgaggggctgtccacaggcagaaaatccagcatttgcaagggtttccgcccgtttttcggccaccgctaacctgtcttttaacctgcttttaaaccaatat ttataaaccttgtttttaaccagggctgcgccctgtgcgcgtgaccgcgcacgccgaaggggggtgcccccccttctcgaaccctcccggcccgcgtaacgcgggcctc ccatccccccaggggctgcgcccctcggccgcgaacggcctcaccccaaaaatggcagcgctggcagtccataattgtggtccaatttgcagccgtccgagacagg aggacatcgtccagctgaaaccggggcagaatccggccatttctgaagagaaaaatggtaaactgatagaataaaatcataagaaggagccgcacatgaaaaaagc agtcattaacggggaacaaatcagaagtatcagcgacctccaccagacattgaaaaaggagcttgcccttccggaatactacggtgaaaacctggacgctttatgggat tgtctgaccggatgggtggagtacccgctcgttttggaatggaggcagtttgaacaaagcaagcagctgactgaaaatggcgccgagagtgtgcttcaggttttccgtga agcgaaagcggaaggctgcgacatcaccatcatactttcttaatacgatcaatgggagatgaacaatatggaaacacaaaccacaattgtggtttcaaaatcggctccgt cgatatactatgttatacgccaactttgaaaacaactttgaaaaagctgttttctggtatttaaggttttagaatgcaaggaacagtgaattggagttcgtcttgttataattagcttc ttggggtatctttaaatactgtagaaaagaggaaggaaataataaatggctaaaatgagaatatcaccggaattgaaaaaactgatcgaaaaataccgctgcgtaaaaga tacggaaggaatgtctcctgctaaggtatataagctggtgggagaaaatgaaaacctatatttaaaaatgacggacagccggtataaagggaccacctatgatgtggaa cgggaaaaggacatgatgctatggctggaaggaaagctgcctgttccaaaggtcctgcactttgaacggcatgatggctggagcaatctgctcatgagtgaggccgat ggcgtcctttgctcggaagagtatgaagatgaacaaagccctgaaaagattatcgagctgtatgcggagtgcatcaggctctttcactccatcgacatatcggattgtccc tatacgaatagcttagacagccgcttagccgaattggattacttactgaataacgatctggccgatgtggattgcgaaaactgggaagaagacactccatttaaagatccg cgcgagctgtatgattttttaaagacggaaaagcccgaagaggaacttgtcttttcccacggcgacctgggagacagcaacatctttgtgaaagatggcaaagtaagtgg cttattgatcttgggagaagcggcagggcggacaagtggtatgacattgccttctgcgtccggtcgatcagggaggatatcggggaagaacagtatgtcgagctattttt tgacttactggggatcaagcctgattgggagaaaataaaatattatattttactggatgaattgttttagtacctagatgtggcgcaacgatgccggcgacaagcaggagc gcaccgacttcttccgcatcaagtgttttggctctcaggccgaggcccacggcaagtatttgggcaaggggtcgctggtattcgtgcagggcaagattcggaataccaa gtacgagaaggacggccagacggtctacgggaccgacttcattgccgataaggtggattatctggacaccaaggcaccaggcgggtcaaatcaggaataagggcac attgccccggcgtgagtcggggcaatcccgcaaggagggtgaatgaatcggacgtttgaccggaaggcatacaggcaagaactgatcgacgcggggttttccgccg aggatgccgaaaccatcgcaagccgcaccgtcatgcgtgcgccccgcgaaaccttccagtccgtcggctcgatggtccagcaagctacggccaagatcgagcgcg acagcgtgcaactggctccccctgccctgcccgcgccatcggccgccgtggagcgttcgcgtcgtctcgaacaggaggcggcaggtttggcgaagtcgatgaccat cgacacgcgaggaactatgacgaccaagaagcgaaaaaccgccggcgaggacctggcaaaacaggtcagcgaggccaagcaggccgcgttgctgaaacacacg aagcagcagatcaaggaaatgcagctttccttgttcgatattgcgccgtggccggacacgatgcgagcgatgccaaacgacacggcccgctctgccctgttcaccacg cgcaacaagaaaatcccgcgcgaggcgctgcaaaacaaggtcattttccacgtcaacaaggacgtgaagatcacctacaccggcgtcgagctgcgggccgacgatg acgaactggtgtggcagcaggtgttggagtacgcgaagcgcacccctatcggcgagccgatcaccttcacgttctacgagctttgccaggacctgggctggtcgatca atggccggtattacacgaaggccgaggaatgcctgtcgcgcctacaggcgacggcgatgggcttcacgtccgaccgcgttgggcacctggaatcggtgtcgctgctg caccgcttccgcgtcctggaccgtggcaagaaaacgtcccgttgccaggtcctgatcgacgaggaaatcgtcgtgctgtttgctggcgaccactacacgaaattcatat gggagaagtaccgcaagctgtcgccgacggcccgacggatgttcgactatttcagctcgcaccgggagccgtacccgctcaagctggaaaccttccgcctcatgtgc ggatcggattccacccgcgtgaagaagtggcgcgagcaggtcggcgaagcctgcgaagagttgcgaggcagcggcctggtggaacacgcctgggtcaatgatgac ctggtgcattgcaaacgctagggccttgtggggtcagttccggctgggggttcagcagccagcgctttactggcatttcaggaacaagcgggcactgctcgacgcactt gcttcgctcagtatcgctcgggacgcacggcgcgctctacgaactgccgatagacaactgtcacggttaagcgagaaatgaataagaaggctgataattcggatctctg cgagggagatgatatttgatccggtgtgaaataccgcacagatgcgtaaggagaaaataccgcatcaggcgctcttccgcttcctcgctcactgactcgctgcgctcggt cgttcggctgcggcgagcggtatcagctcactcaaaggcggtaatacggttatccacagaatcaggggataacgcaggaaagaacatgtgagcaaaaggccagcaa aaggccaggaaccgtaaaaaggccgcgttgctggcgtttttccataggctccgcccccctgacgagcatcacaaaaatcgacgctcaagtcagaggtggcgaaaccc gacaggactataaagattaccaggcgtttccccctggaagctccctcgtgcgctctcctgttccgaccctgccgcttaccggatacctgtccgcctttctcccttcgggaag cgtggcgctttctcatagctcacgctgtaggtatctcagttcggtgtaggtcgttcgctccaagctgggctgtgtgcacgaaccccccgttcagccgaccgctgcgcctt atccggtaactatcgtcttgagtccaacccggtaagacacgacttatcgccactggcagcagccactggtaacaggattagcagagcgaggtatgtaggcggtgctaca gagttcttgaagtggtggcctaactacggctacactagaaggacagtatttggtatctgcgctctgctgaagccagttaccttcggaaaaagagttggtagctcttgatccg gcaaacaaaccaccgctggtagcggtggtttttttgcaagcagcagattacgcgcagaaaaaaaggatatcaagaagatcctttgatcttttctacggggtctgacg ctcagtggaacgaaaactcacgttaagggattttggtcatgagattatcaaaaaggatcttcacctagatccttttaaattaaaaatgaagttttaaatcaatctaaagtatata tgagtaaacttggtctgacagttaccaatgcttcatcagtgaggctgatcacaggcagcaacgctctgtcatcgttacaatcaacatgctaccctccgcgagatcatccgt gtttcaaacccggcagcttagttgccgttcttccgaatagcatcggtaacatgagcaaagtctgccgccttacaacggctctcccgctgacgccgtcccggactgatggg ctgcctgtatcgagtggtgattttgtgccgagctgccggtcggggagctgttggctggctggtggcaggatatattgtggtgtaaacaaattgacgcttagacaacttaata acacaccgcggtctagaactagtggatcccccctacgtgcgatctagtaacatagatgacaccgcgcgcgataatttatcctagtttgcgcgctatattttgttttctatcgc gtattaaatgtataattgcgggactctaatcataaaaacccatctcataaataacgtcatgcattacatgttaattattacatgcttaacgtaattcaacagaaatttatatgataat catcgcaagaccggcaacaggattcaatcttaagaaactttattgccaaatgtttgaacgatccctcagaagaagtcgtcaagaaggcgatagaaggcgatgcgctgcg aatcgggagcggcgataccgtaaagcacgaggaagcggtcagcccattcgccgccaagctcttcagcaatatcacgggtagccaacgctatgtcctgatagcggtcc gccacacccagccggccacagtcgatgaatccagaaaagcggccattttccaccatgatattcggcaagcaggcatcgccatgggtcacgacgagatcctcgccgtc gggcatgcgcgccttgagcctggcgaacagttcggctggcgcgagcccctgatgctcttcgtccagatcatcctgatcgacaagaccggcttccatccgagtacgtgct cgctcgatgcgatgtttcgcttggtggtcgaatgggcaggtagccggatcaagcgtatgcagccgccgcattgcatcagccatgatggatactttctcggcaggagcaa ggtgagatgacaggagatcctgccccggcacttcgcccaatagcagccagtcccttcccgcttcagtgacaacgtcgagcacagctgcgcaaggaacgcccgtcgtg gccagccacgatagccgcgctgcctcgtcctggagttcattcagggcaccggacaggtcggtcttgacaaaaagaaccgggcgcccctgcgctgacagccggaaca cggcggcatcagagcagccgattgtctgttgtgcccagtcatagccgaatagcctctccacccaagcggccggagaacctgcgtgcaatccatcttgttcaatcatctgt taatcagaaaaactcagattaatcgacaaattcgatcgcacaaactagaaactaacaccagatctagatagaaatcacaaatcgaagagtaattattcgacaaaactcaa attatttgaacaaatcggatgatatttatgaaaccctaatcgagaattaagatgatatctaacgatcaaacccagaaaatcgtcttcgatctaagattaacagaatctaaacca aagaacatatacgaaattgggatcgaacgaaaacaaaatcgaagattttgagagaataaggaacacagaaatttaccttgatcacggtagagagaattgagagaaagtt tttaagattttgagaaattgaaatctgaattgtgaagaagaagagctctttgggtattgttttatagaagaagaagaagaaaagacgaggacgactaggtcacgagaaagc taaggcggtgaagcaatagctaataataaaatgacacgtgtattgagcgttgtttacacgcaaagttgtttttggctaattgccttatttttaggttgaggaaaagtatttgtgct ttgagttgataaacacgactcgtgtgtgccggctgcaaccactttgacgccgtttattactgactcgtcgacaaccacaatttctaacggtcgtcataagatccagccgttga gatttaacgatcgttacgattatatttttttagcattatcgttttattttttaaatatacggtggagctgaaaattggcaataattgaaccgtgggtcccactgcattgaagcgtatt tcgtattttctagaattcttcgtgctttatttcttttcctttttgtttttttttgccatttatctaatgcaagtgggcttataaaatcagtgaatttcttggaaaagtaacttctttatc gtataacatattgtgaaattatccatttcttttaattttttagtgttattggatatttttgtatgattatgatttgcataggataatgacttttgtatcaagttggtgaacaagtctcgt taaaaaaggcaagtggtttggtgactcgatttattcttgttatttaattcatatatcaatggatcttatttggggcctggtccatatttaacactcgtgttcagtccaatgaccaataat attttttcattaataacaatgtaacaagaatgatacacaaaacattctttgaataagttcgctatgaagaagggaacttatccggtcctagatcatcagttcatacaaacctccataga gttcaacatcttaaacaaggatatcctgatccgttgacggcgcgccaagcggccgcatttaaatgggccctcgagagcccaaatgcggccgcaaaacccctcacaaat acataaaaaaaattctttatttaattatcaaactctccactacctttcccaccaaccgttacaatcctgaatgttggaaaaaactaactacattgatataaaaaaactacattact tcctaaatcatatcaaaattgtataaatatatccactcaaaggagtctagaagatccacttggacaaattgcccatagttggaaagatgttcaccaagtcaacaagatttatc aatggaaaaatccatctaccaaacttactttcaagaaaatccaaggattatagagtaaaaaatctatgtattattaagtcaaaaagaaaaccaaagtgaacaaatattgatgt acaagtttgagaggataagacattggaatcgtctaaccaggaggcggaggaattccctagacagttaaaagtggccggaatcccggtaaaaaagattaaaatttttttgta gagggagtgcttgaatcatgttttttatgatggaaatagattcagcaccatcaaaaacattcaggacacctaaaattttgaagtttaacaaaaataacttggatctacaaaaat ccgtatcggattttctctaaatataactagaattttcataactttcaaagcaactcctcccctaaccgtaaaacttttcctacttcaccgttaattacattccttaagagtgataaa gaaataaagtaaataaaagtattcacaaaccaacaatttatttcttttatttacttaaaaaaacaaaaagtttatttattttacttaaatggcataatgacatatcggagatccctc gaacgagaatcttttatctccctggttttgtattaaaaagtaatttattgtggggtccacgcggagttggaatcctacagacgcgctttacatacgtctccgagaagcgtgacg gatgtgcgaccggatgaccctgtataacccaccgacacagccagcgcacagtatacacgtgtcatttctctattggaaaatgtcgttgttatccccgctggtacgcaacca ccgatggtgacaggtcgtctgttgtcgtgtcgcgtagcgggagaagggtctcatccaacgtattaaatactcgccttcaccgcgttacttctcatcttttctcttgcgttgtat aatcagtgcgatattctcagagagcttttcattcaaaggtatggagttttgaagggctttactcttaacatttgtttttctttgtaaattgttaatggtggtttctgtgggggaagaa tcttttgccaggtccttttgggtttcgcatgtttatttgggttatttttctcgactatggctgacattactagggctttcgtgctttcatctgtgttttcttcccttaataggtctgtct ctctggaatatttaattttcgtatgtaagttagagtagtcgctgtttgtaataggctcttgtctgtaaaggtttcagcaggtgtttgcgttttattgcgtcatgtgtttcagaaggcctt tgcagattattgcgttgtactttaatattttgtctccaaccttgttatagtttccctcctttgatctcacaggaaccctttcttctttgagcattttcttgtggcgttctgtagtaatat attttaattttgggcccgggttctgagggtaggtgattattcacagtgatgtgctttccctataaggtcctctatgtgtaagctgttagggtttgtgcgttactattgacatgtcacatg tcacatattttcttcctcttatccttcgaactgatggttctttttctaattcgtggattgctggtgccatattttatttctattgcaactgtattttagggtgtctctttctttttgatt tcttgttaatatttgtgttcaggttgtaactatgggttgctagggtgtctgccctcttcttttgtgcttctttcgcagaatctgtccgttggtctgtatttgggtgatgaattatttatt ccttgaagtatctgtctaattagcttgtgatgatgtgcaggtatattcgttagtcatatttcaatttcaagcgatcccccgggcccccatggatccagtagaaaccccaacccgtgaaat caaaaaactcgacggcctgtgggcattcagtctggatcgcgaaaactgtggaattggtcagcgttggtgggaaagcgcgttacaagaaagccgggcaattgctgtgccag gcagttttaacgatcagttcgccgatgcagatattcgtaattatgcgggcaacgtctggtatcagcgcgaagtctttataccgaaaggttgggcaggccagcgtatcgtgc tgcgtttcgatgcggtcactcattacggcaaagtgtgggtcaataatcaggaagtgatgggcatcagggcggctatacgccatttgaagccgatgtcacgccgtatgtt attgccgggaaaagtgtacgtaagtttctgcttctacctttgatatatatataataattatcattaattagtagtaatataatatttcaaatatttttttcaaaataaaagaatgtagta tatagcaattgcttttctgtagtttataagtgtgtatattttaatttataacttttctaatatatgaccaaaatttgttgatgtgcaggtatcaccgtttgtgtgaacaacgaactgaac tggcagactatcccgccgggaatggtgattaccgacgaaaacggcaagaaaaagcagtcttacttccatgatttctttaactatgccggaatccatcgcagcgtaatgctc tacaccacgccgaacacctgggtggacgatatcaccgtggtgacgcatgtcgcgcaagactgtaaccacgcgtctgttgactggcaggtggtggccaatggtgatgtc agcgttgaactgcgtgatgcggatcaacaggtggttgcaactggacaaggcactagcgggactttgcaagtggtgaatccgcacctctggcaaccgggtgaaggttat ctctatgaactgtgcgtcacagccaaaagccagacagagtgtgatatctacccgcttcgcgtcggcatccggtcagtggcagtgaagggcgaacagttcctgattaacc acaaaccgttctactttactggctttggtcgtcatgaagatgcggacttgcgtggcaaaggattcgataacgtgctgatggtgcacgaccacgcattaatggactggattg gggccaactcctaccgtacctcgcattacccttacgctgaagagatgctcgactgggcagatgaacatggcatcggtggtgattgatgaaactgctgctgtcggctttaacc tctctttaggcattggtttcgaagcgggcaacaagccgaaagaactgtacagcgaagaggcagtcaacggggaaactcagcaagcgcacttacaggcgattaaagag ctgatagcgcgtgacaaaaaccacccaagcgtggtgatgtggagtattgccaacgaaccggatacccgtccgcaaggtgcacgggaatatttcgcgccactggcgga agcaacgcgtaaactcgacccgacgcgtccgatcacctgcgtcaatgtaatgttctgcgacgctcacaccgataccatcagcgatctctttgatgtgctgtgcctgaacc gttattacggatggtatgtccaaagcggcgatttggaaacggcagagaaggtactggaaaaagaacttctggccggcaggagaaactgcatcagccgattacatcac cgaatacggcgtggatacgttagccgggctgcactcaatgtacaccgacatgtggagtgaagagtatcagtgtgcatggctggatatgtatcaccgcgtctttgatcgcg tcagcgccgtcgtcggtgaacaggtatggaatttcgccgattttgcgacctcgcaaggcatattgcgcgttggcggtaacaagaaagggatcttcactcgcgaccgcaa accgaagtcggcggcttttctgctgcaaaaacgctggactggcatgaacttcggtgaaaaaccgcagcagggaggcaaacaatgaatcaacaactctcctggcgcac catcgtcggctacagcctcgggaattgctaccgagggttcgaaatcgatgggtgttatttgtggataataaattcgggtgatgttcagtgtttgtcgtatttctcacgaataaa ttgtgtttatgtatgtgttagtgttgtttgtctgtttcagaccctcttatgttatatttttcttttcgtcggtcagttgaagccaatactggtgtcctggccggcactgcaataccattt cgtttaatataaagactctgttatccgtgagctcgaatttccccgatcgttcaaacatttggcaataaagtttcttaagattgaatcctgttgccggtcttgcgatgattatcatata atttctgttgaattacgttaagcatgtaataattaacatgtaatgcatgacgttatttatgagatgggtttttatgattagagtccgcaattatacatttaatacgcgatagaaaa caaaatatagcgcgcaaactaggataaattatcgcgcgcggtgtcatctatgttactagatcgcggccgcatttgggctcctgcaggtaccttaattaaaagtttaaactatc agtgtttgacaggatatattggcgggtaaacctaagagaaaagagcgtttattagaataatcggatatttaaaagggcgtgaaaaggtttatccgttcgtccatttgtatgtg catgccaaccacagggttccccagatc 60 pWVR219 cttccagggggaaacgcctggtatctttatagtcctgtcgggtttcgccacctctgacttgagcgtcgatttttgtgatgctcgtcaggggggcggagcctatggaaaaac gccagcaacgcggccttttttacggttcctggcttttgctggccttttgctcacatgttctttcctgcgttatcccctgattctgtggataaccgtattaccgcctttgagtgagct gataccgctcgccgcagccgaacgaccgagcgcagcgagtcagtgagcgaggaagcggaagagcgcccaatacgcaaaccgcctctccccgcgcgttggccgat tcattaatgcagctggcacgacaggtttcccgactggaaagcgggcagtgagcgcaacgcaattaatgtgagttagctcactcattaggcaccccaggctttacactttat gcttccggctcgtatgttgtgtggaattgtgagcggataacaatttcacacaggaaacagctatgaccatgattacgccaagctgagagacataatgtggtttgtgtttcca tattgttcatctcccattgatcgtattaagaaagtatgatggtgatgtcgcagccttccgctttcgcttcacggaaaacctgaagcacactctcggcgccattttcagtcagct gcttgctttgttcaaactgcctccattccaaaacgagcgggtactccacccatccggtcagacaatcccataaagcgtccaggttttcaccgtagtattccggaagggcaa gctcctttttcaatgtctggtggaggtcgctgatacttctgatttgttccccgttaatgactgcttttttcatgtgcggctcctttcttatgattttattctatcagtttaccatttttc tcttcagaaatggccggattctgccccggtttcagctggacgatgtcctcctgtctcggacggctgctgcaaattggaccacattatggtctctcagcttgcatgccaaactttta attaaggtacctgcaggagcccgggctctcgagtaaaacataattttggcagtaaaaagtgaattctattgttttgaaaacaaaacaaaatacaggaagcgtgattgtggg gttgttgttgaacttgcccgggcaaaagaagaatgattagcggtagaggagttagtagttacgttcaactaaatgcgtgactaaattatttatcctccgccatggaagcagg tgattcacacacaacttgctgcacacattgctctcaaacctttcctataaatatccgtagcaggggctgcgatgatacacaacgcatttaatcaaactactttgattactttctg tgggttctactttctttgaatagtcagttctgctgtttttagaagatttatgagaatggccaaaattcaggtatcaaacgggaacatggcacaggttatcaacacgtttgacgg ggttgcggattatcttcagacatatcataagctacctgataattacattacaaaatcagaagcacaagccctcggctgggtggcatcaaaagggaaccttgcagacgtcg ctccggggaaaagcatcggcggagacatcttctcaaacagggaaggcaaactcccgggcaaaagcggacgaacatggcgtgaagcggatattaactatacatcagg cttcagaaattcagaccggattctttactcaagcgactggctgatttacaaaacaacggacgagtatcagacctttacaaaaatcagataacgaaaaaaacggcttccctg cgggaggccgtttttttcagctttacataaagtgtgtaataaatttttcttcaaactctgatcggtcaagagctcttctgagagacaatacatacatgtctctgatgttgtaacttt actaccaaaacctataaagattggcttatttcgttctattggatatgtatcatcattactggtaaatcaagtttctttctaataatgtagaagatcagaaaatccataagaagatat caacatttgagttctatggtaaattgaattatatcaacttagttgcaatgattcattcttgactgatgcattgatggcttatcaaaccagtttacaaaattcgattagatagggccc atttaaatgcggccgcttggcgcgcctgttaattcactggccgtcgttttacaacgtcgtgactgggaaaaccctggcgttacccaacttaatcgccttgcagcacatcccc ctttcgccagctggcgtaatagcgaagaggcccgcaccgatcgcccttcccaacagttgcgcagcctgaatggcgaatggcgcctgatgcggtattttctccttacgcat ctgtgcggtatttcacaccgcatatggtgcactctcagtacaatctgctctgatgccgcatagttaagccagccccgacacccgccaacacccgctgacgcgccctgac gggcttgtctgctcccggcatccgcttacagacaagctgtgaccgtctccgggagctgcatgtgtcagaggttttcaccgtcatcaccgaaacgcgcgagacgaaagg gcctcgtgatacgcctatttttataggttaatgtcatgataataatggtttcttagacgtcaggtggcactttcggggaaatgtgcgcggaacccctatttgtttatttttctaaat acattcaaatatgtatccgctcatgagacaataaccctgataaatgcttcaataatattgaaaaaggaagagtatgagtattcaacatttccgtgtcgcccttattcccttttttg cggcattttgccttcctgtttttgctcacccagaaacgctggtgaaagtaaaagatgctgaagatcagttgggtgcacgagtgggttacatcgaactggatctcaacagcg gtaagatccttgagagttttcgccccgaagaacgttttccaatgatgagcacttttaaagttctgctatgtggcgcggtattatcccgtattgacgccggcaagagcaactc ggtcgccgcatacactattctcagaatgacttggttgagtactcaccagtcacagaaaagcatcttacggatggcatgacagtaagagaattatgcagtgctgccataacc atgagtgataacactgcggccaacttacttctgacaacgatcggaggaccgaaggagctaaccgcttttttgcacaacatgggggatcatgtaactcgccttgatcgttgg gaaccggagctgaatgaagccataccaaacgacgagcgtgacaccacgatgcctgtagcaatggcaacaacgttgcgcaaactattaactggcgaactacttactcta gcttcccggcaacaattaatagactggatggaggcggataaagttgcaggaccacttctgcgctcggcccttccggctggctggtttattgctgataaatctggagccgg tgagcgtgggtctcgcggtatcattgcagcactggggccagatggtaagccctcccgtatcgtagttatctacacgacggggagtcaggcaactatggatgaacgaaat agacagatcgctgagataggtgcctcactgataagcattggtaactgtcagaccaagtttactcatatatactttagattgatttaaaacttcatttttaatttaaaaggatcta ggtgaagatcctttttgataatctcatgaccaaaatcccttaacgtgagttttcgttccactgagcgtcagaccccgtagaaaagatcaaaggatcttcttgagatcctttttttc tgcgcgtaatctgctgcttgcaaacaaaaaaaccaccgctaccagcggtggtttgtttgccggatcaagagctaccaactctttttccgaaggtaactggcttcagcagag cgcagataccaaatactgtccttctagtgtagccgtagttaggccaccacttcaagaactctgtagcaccgcctacatacctcgctctgctaatcctgttaccagtggctgct gccagtggcgataagtcgtgtcttaccgggttggactcaagacgatagttaccggataaggcgcagcggtcgggctgaacggggggttcgtgcacacagcccagctt ggagcgaacgacctacaccgaactgagatacctacagcgtgagctatgagaaagcgccacgcttccgaagggagaaaggcggacaggtatccggtaagcggca gggtcggaacaggagagcgcacgagggag 61 pARB1002 cgccggcgttgtggatacctcgcggaaaacttggcctcactgacagatgaggggcggacgttgacacttgaggggccgactcacccggcgcggcgttgacagatg aggggcaggctcgatttcggccggcgacgtggagctggccagcctcgcaaatcggcgaaaacgcctgattttacgcgagtttcccacagatgatgtggacaagcctg gggataagtgccctgcggtattgacacttgaggggcgcgactactgacagatgaggggcgcgatccttgacacttgaggggcagagtgctgacagatgaggggcgc acctattgacatttgaggggctgtccacaggcagaaaatccagcatttgcaagggtttccgcccgtttttcggccaccgctaacctgtcttttaacctgcttttaaaccaatat ttataaaccttgtttttaaccagggctgcgccctgtgcgcgtgaccgcgcacgccgaaggggggtgcccccccttctcgaaccctcccggcccgctaacgcgggcctc ccatccccccaggggctgcgcccctcggccgcgaacggcctcaccccaaaaatggcagcgctggcagtccataattgtggtccaatttgcagccgtccgagacagg aggacatcgtccagctgaaaccggggcagaatccggccatttctgaagagaaaaatggtaaactgatagaataaaatcataagaaaggagccgcacatgaaaaaagc agtcattaacggggaacaaatcagaagtatcagcgacctccaccagacattgaaaaaggagcttgcccttccggaatactacggtgaaaacctggacgctttatgggat tgtctgaccggatgggtggagtacccgctcgttttggaatggaggcagtttgaacaaagcaagcagctgactgaaaatggcgccgagagtgtgcttcaggttttccgtga agcgaaagcggaaggctgcgacatcaccatcatactttcttaatacgatcaatgggagatgaacaatatggaaacacaaaccacaattgtggtttcaaaatcggctccgt cgatactatgttatacgccaactttgaaaacaactttgaaaaagctgttttctggtatttaaggttttagaatgcaaggaacagtgaattggagttcgtcttgttataattagcttc ttggggtatctttaaatactgtagaaaagaggaaggaaataataaatggctaaaatgagaatatcaccggaattgaaaaaactgatcgaaaaataccgctgcgtaaaaga tacggaaggaatgtctcctgctaaggtatataagctggtgggagaaaatgaaaacctatatttaaaaatgacggacagccggtataaagggaccacctatgatgtggaa cgggaaaaggacatgatgctatggctggaaggaaagctgcctgttccaaaggtcctgcactttgaacggcatgatggctggagcaatctgctcatgagtgaggccgat ggcgtcctttgctcggaagagtagaagatgaacaaagccctgaaaagattatcgagctgtatgcggagtgcatcaggctctttcactccatcgacatatcggattgtccc tatacgaatagcttagacagccgcttagccgaattggattacttactgaataacgatctggccgatgtggattgcgaaaactgggaagaagacactccatttaaagatccg cgcgagctgtatgattttttaaagacggaaaagcccgaagaggaacttgtcttttcccacggcgacctgggagacagcaacatctttgtgaaagatggcaaagtaagtgg ctttattgatcttgggagaagcggcagggcggacaagtggtatgacattgccttctgcgtccggtcgatcagggaggatatcggggaagaacagtatgtcgagctattttt tgacttactggggatcaagcctgattgggagaaaataaaatattatattttactggatgaattgttttagtacctagatgtggcgcaacgatgccggcgacaagcaggagc gcaccgacttcttccgcatcaagtgttttggctctcaggccgaggcccacggcaagtatttgggcaaggggtcgctggtattcgtgcagggcaagattcggaataccaa gtacgagaaggacggccagacggtctacgggaccgacttcattgccgataaggtggattatctggacacaccaaggcaccaggcgggtcaaatcaggaataagggcac attgccccggcgtgagtcggggcaatcccgcaaggagggtgaatgaatcggacgtttgaccggaaggcatacaggcaagaactgatcgacgcggggttttccgccg aggatgccgaaaccatcgcaagccgcaccgtcatgcgtgcgccccgcgaaaccttccagtccgtcggctcgatggtccagcaagctacggccaagatcgagcgcg acagcgtgcaactggctccccctgccctgcccgcgccatcggccgccgtggagcgttcgcgtcgtctcgaacaggaggcggcaggtttggcgaagtcgatgaccat cgacacgcgaggaactatgacgaccaagaagcgaaaaaccgccggcgaggacctggcaaaacaggtcagcgaggccaagcaggccgcgttgctgaaacacacg aagcagcagatcaaggaaatgcagctttccttgttcgatattgcgccgtggccggacacgatgcgagcgatgccaaacgacacggcccgctctgccctgttcaccacg cgcaacaagaaatcccgcgcgaggcgctgcaaaacaaggtcattttccacgtcaacaaggacgtgaagatcacctacaccggcgtcgagctgcgggccgacgatg acgaactggtgtggcagcaggtgttggagtacgcgaagcgcacccctatcggcgagccgatcaccttcacgttctacgagctttgccaggacctgggctggtcgatca atggccggtattacacgaaggccgaggaatgcctgtcgcgcctacaggcgacggcgatgggcttcacgtccgaccgcgttgggcacctggaatcggtgtcgctgctg caccgcttccgcgtcctggaccgtggcaagaaaacgtcccgttgccaggtcctgatcgacgaggaaatcgtcgtgctgtttgctggcgaccactacacgaaattcatat gggagaagtaccgcaagctgtcgccgacggccgacggatgttcgactatttcagctcgcaccgggagccgtacccgctcaagctggaaaccttccgcctcatgtgc ggatcggattccacccgcgtgaagaagtggcgcgagcaggtcggcgaagcctgcgaagagttgcgaggcagcggcctggtggaacacgcctgggtcaatgatgac ctggtgcattgcaaacgctagggccttgtggggtcagttccggctgggggttcagcagccagcgctttactggcatttcaggaacaagcgggcactgctcgacgcactt gcttcgctcagtatcgctcgggacgcacggcgcgctctacgaactgccgatagacaactgtcacggttaagcgagaaatgaataagaaggctgataattcggatctctg cgagggagatgatatttgatccggtgtgaaataccgcacagatgcgtaaggagaaaataccgcatcaggcgctcttccgcttcctcgctcactgactcgctgcgctcggt cgttcggctgcggcgagcggtatcagctcactcaaaggcggtaatacggttatccacagaatcaggggataacgcaggaaagaacatgtgagcaaaaggccagcaa aaggccaggaaccgtaaaaaggccgcgttgctggcgtttttccataggctccgcccccctgacgagcatcacaaaaatcgacgctcaagtcagaggtggcgaaaccc gacaggactataaagataccaggcgtttccccctggaagctccctcgtgcgctctcctgttccgaccctgccgcttaccggatactgtccgcctttctcccttcgggaag cgtggcgctttctcatagctcacgctgtaggtatctcagttcggtgtaggtcgttcgctccaagctgggctgtgtgcacgaaccccccgttcagcccgaccgctgcgcctt atccggtaactatcgtcttgagtccaacccggtaagacacgacttatcgccactggcagcagccactggtaacaggattagcagagcgaggtatgtaggcggtgctaca gagttcttgaagtggtggcctaactacggctacactagaaggacagtatttggtatctgcgctctgctgaagccagttaccttcggaaaaagagttggtagctcttgatccg gcaaacaaaccaccgctggtagcggtggtttttttgtttgcaagcagcagattacgcgcagaaaaaaaggatatcaagaagatcctttgatcttttctacggggtctgacg ctcagtggaacgaaaactcacgttaagggattttggtcatgagattatcaaaaaggatcttcacctagatccttttaaattaaaaatgaagttttaaatcaatctaaagtatata tgagtaaacttggtctgacagttaccaatgcttcatcagtgaggctgatcacaggcagcaacgctctgtcatcgttacaatcaacatgctaccctccgcgagatcatccgt gtttcaaacccggcagcttagttgccgttcttccgaatagcatcggtaacatgagcaaagtctgccgccttacaacggctctcccgctgacgccgtcccggactgatggg ctgcctgtatcgagtggtgattttgtgccgagctgccggtcggggagctgttggctggctggtggcaggatatattgtggtgtaaacaaattgacgcttagacaacttaata acacaccgcggtctagaactagtggatcccccctacgtgcgatctagtaacatagatgacaccgcgcgcgataatttatcctagtttgcgcgctatattttgttttctatcgc gtattaaatgtataattgcgggactctaatcataaaaacccatctcataaataacgtcatgcattacatgttaattattacatgcttaacgtaattcaacagaaattatatgataat catcgcaagaccggcaacaggattcaatcttaagaaactttattgccaaatgtttgaacgatccctcagaagaactcgtcaagaaggcgatagaaggcgatgcgctgcg aatcgggagcggcgataccgtaaagcacgaggaagcggtcagcccattcgccgccaagctcttcagcaatatcacgggtagccaacgctatgtcctgatagcggtcc gccacacccagccggccacagtcgatgaatccagaaaagcggccattttccaccatgatattcggcaagcaggcatcgccatgggtcacgacgagatcctcgccgtc gggcatgcgcgccttgagcctggcgaacagttcggctggcgcgagcccctgatgctcttcgtccagatcatcctgatcgacaagaccggcttccatccgagtacgtgct cgctcgatgcgatgtttcgcttggtggtcgaatgggcaggtagccggatcaagcgtatgcagccgccgcattgcatcagccatgatggatactttctcggcaggagcaa ggtgagatgacaggagatcctgccccggcacttcgcccaatagcagccagtcccttcccgcttcagtgacaacgtcgagcacagctgcgcaaggaacgcccgtcgtg gccagccacgatagccgcgctgcctcgtcctggagttcattcagggcaccggacaggtcggtcttgacaaaaagaaccgggcgcccctgcgctgacagccggaaca cggcggcatcagagcagccgattgtctgttgtgcccagtcatagccgaatagcctctccacccaagcggccggagaacctgcgtgcaatccatcttgttcaatcatctgt taatcagaaaaactcagattaatcgacaaattcgatcgcacaaactagaaactaacaccagatctagatagaaatcacaaatcgaagagtaattattcgacaaaactcaa attatttgaacaaatcggatgatatttatgaaaccctaatcgagaattaagatgatatctaacgatcaaacccagaaaatcgtcttcgatctaagattaacagaatctaaacca aagaacatatacgaaattgggatcgaacgaaaacaaaatcgaagattttgagagaataaggaacacagaaatttaccttgatcacggtagagagaattgagagaaagtt tttaagattttgagaaattgaaatctgaattgtgaagaagaagagctctttgggtattgttttatagaagaagaagaagaaaagacgaggacgactaggtcacgagaaagc taaggcggtgaagcaatagctaataataaaatgacacgtgtattgagcgttgtttacacgcaaagttgtttttggctaattgccttatttttaggttgaggaaaagtatttgtgct ttgagttgataaacacgactcgtgtgtgccggctgcaaccactttgacgccgtttattactgactcgtcgacaaccacaatttctaacggtcgtcataagatccagccgttga gatttaacgatcgttacgatttatatttttttagcattatcgtttattttttaaatatacggtggagctgaaaattggcaataattgaaccgtgggtcccactgcattgaagcgtatt tctattttctagaattcttcgtgctttatttcttttccttttcctttttgtttttttttgccatttatctaatgcaagtgggcttataaaatcagtgaatttcttggaaaagtaacttct ttatcgtataacatattgtgaaattatccatttcttttaattttttagtgttattggatatttttgtatgattattgatttgcataggataatgacttttgtatcaagttggtgaacaag tctcgttaaaaaaggcaagtggtttggtgactcgatttattcttgttatttaattcatatatcaatggatcttatttggggcctggtccatatttaacactcgtgttcagtccaatgacc aataatattttttcattaataacaatgaacaagaatgatacacaaaacattctttgaataagttcgctatgaagaagggaacttatccggtcctagatcatcagttcatacaaacctcca tagagttcaacatcttaaacaaggatatcctgatccgttgacggcgcgccaagcggggccgcatttaaatgggccctatctaatcgaattttgtaactggtttgataagccatca atgcatcagtcaagaatgaatcattgcaactaagttgatataattcaatttaccatagaactcaaatgttgatatcttcttatggattttctgatcttctacattattagaaagaaac ttgatttaccagtaatgatgatacatatccaatagaacgaaataagccaatctttataggttttggtagtaaagttacaacatcagagacatgtatgtattgtctctcagaagag ctcttgaccgatcagagtttgaagaaaaatttattacacactttatgtaaagctgaaaaaaacggcctcccgcagggaagccgtttttttcgttatctgatttttgtaaaggtct gatactcgtccgttgttttgtaaatcagccagtcgcttgagtaaagaatccggtctgaatttctgaagcctgatgtatagttaatatccgcttcacgccatgttcgtccgcttttg cccgggagtttgccttccctgtttgagaagatgtctccgccgatgcttttccccggagcgacgtctgcaaggttcccttttgatgccacccagccgagggcttgtgcttctg attttgtaatgtaattatcaggtagcttatgatatgtctgaagataatccgcaacccgtcaaacgtgttgataacctgtgccatgttcccgtttgatacctgaattttggccattc tcataaatcttctaaaaacagcagaactgactattcaaagaaagtagaacccacagaaagtaatcaaagtagtttgattaaatgcgttgtgtatcatcgcagcccctgctac ggatatttataggaaaggtttgagagcaatgtgtgcagcaagttgtgtgtgaatcacctgcttccatggcggaggataaataatttagtcacgcatttagttgaacgtaacta ctaactcctctaccgctaatcattcttcttttgcccgggcaagttcaacaacaaccccacaatcacgcttcctgtatttttgttttgttttcaaaacaatagaattcactttttactgc caaaattatgttttactcgagagcccaaatgcggccgcaaaacccctcacaaatacataaaaaaaattctttatttaattatcaaactctccactacctttcccaccaaccgtt acaatcctgaatgttggaaaaaactaactacattgatataaaaaaactacattacttcctaaatcatatcaaaattgtataaatatatccactcaaaggagtctagaagatcca cttggacaaattgcccatagttggaaagatgttcaccaagtcaacaagatttatcaatggaaaaatccatctaccaaacttactttcaagaaaatccaaggattatagagtaa aaaatctatgtattattaagtcaaaaagaaaaccaaagtgaacaaatattgatgtacaagtttgagaggataagacattggaatcgtctaaccaggaggcggaggaattcc ctagacagttaaaagtggccggaatcccggtaaaaaagattaaaatttttttgtagagggagtgcttgaatcatgttttttatgatggaaatagattcagcaccatcaaaaac attcaggacacctaaaattttgaagtttaacaaaaataacttggatctacaaaaatccgtatcggattttctctaaatataactagaattttcataactttcaaagcaactcctcc cctaaccgtaaaacttttcctacttcaccgttaattacattccttaagagtgataaagaaataaagtaaataaaagtattcacaaaccaacaatttatttcttttatttacttaaaaa aacaaaaagtttatttattttacttaaatggcataatgacatatcggagatccctcgaacgagaatcttttatctccctggttttgtattaaaaagtaatttattgtggggtccacg cggagttggaatcctacagacgcgctttacatacgtctcgagaagcgtgacggatgtgcgaccggatgaccctgtataacccaccgacacagccagcgcacagtatac acgtgtcatttctctattggaaaatgtcgttgttatccccgctggtacgcaaccaccgatggtgacaggtcgtctgttgtcgtgtcgcgtagcgggagaagggtctcatcca acgctattaaatactcgccttcaccgcgttacttctcatcttttctcttgcgttgtataatcagtgcgatattctcagagagcttttcattcaaaggtatggagttttgaagggcttt actcttaacatttgtttttctttgtaaattgttaatggtggtttctgtgggggaagaatcttttgccaggtccttttgggtttcgcatgtttatttgggttatttttctcgactatggct gacattactagggctttcgtgctttcatctgtgttttcttcccttaataggtctgtctctctggaatatttaattttcgtatgtaagttatgagtagtcgctgtttgtaataggctcttg tctgtaaaggtttcagcaggtgtttgcgttttattgcgtcatgtgtttcagaaggcctttgcagattattgcgttgtactttaatattttgtctccaaccttgttatagtttccctcctt tgatctcacaggaaccctttcttctttgagcttttcttgtggcgttctgtagtaatattttaatttgggcccgggttctgagggtaggtgattattcacagtgatgtgctttccctataa ggtcctctatgtgtaagctgttagggtttgtgcgttactattgacatgtcacatgtcacatattttcttcctcttatccttcgaactgatggttctttttctaattcgtggattgctggt gccatattttatttctattgcaactgtattttagggtgtctctttcttttgatttcttgttaatatttgtgttcaggttgtaactatgggttgctagggtgtctgccctcttcttttgtg cttctttcgcagaatctgtccgttggtctgtatttgggtgatgaattatttattccttgaagtatctgtctaattagcttgtgatgatgtgcaggtatattcgttagtcatatttcaatt tcaagcgatcccccgggcccccatggatccagtagaaaccccaacccgtgaaatcaaaaaactcgacggcctgtgggcattcagtctggatcgcgaaaactgtggaattggtc agcgttggtgggaaagcgcgttacaagaaagccgggcaattgctgtgccaggcagttttaacgatcagttcgccgatgcagatattcgtaattatgcgggcaacgtctg gtatcagcgcgaagtctttataccgaaaggttgggcaggccagcgtatcgtgctgcgtttcgatgcggtcactcattacggcaaagtgtgggtcaataatcaggaagtga tggagcatcagggcggctatacgccatttgaagccgatgtcacgccgtatgttattgccgggaaaagtgtacgtaagtttctgcttctacctttgatatatatataataattatc attaattagtagtaatataatatttcaaatatttttttcaaaataaaagaatgtagtatatagcaattgcttttctgtagtttataagtgtgtatattttaatttataacttttctaata tatgaccaaaatttgttgatgtgcaggtatcaccgtttgtgtgaacaacgaactgaactggcagaactatcccgccgggaatggtgattaccgacgaaaacggcaagaaaaagc agtcttacttccatgatttctttaactatgccggaatccatcgcagcgtaatgctctacaccacgccgaacacctgggtggacgatatcaccgtggtgacgcatgtcgcgc aagactgtaaccacgcgtctgttgactggcaggtggtggccaatggtgatgtcagcgttgaactgcgtgatgcggatcaacaggtggttgcaactggacaaggcactag cgggactttgcaagtggtgaatccgcacctctggcaaccgggtgaaggttatctctatgaactgtgcgtcacagccaaaagccagacagagtgtgatatctacccgcttc gcgtcggcatccggtcagtggcagtgaagggcgaacagttcctgattaaccacaaccgttctactttactggctttggtcgtcatgaagatgcggacttgcgtggcaaa ggattcgataacgtgctgatggtgcacgaccacgcattaatggactggattggggccaactcctaccgtacctcgcattacccttacgctgaagagatgctcgactgggc agatgaacatggcatcgtggtgattgatgaaactgctgctgtcggctttaacctctctttaggcattggtttcgaagcgggcaacaagccgaaagaactgtacagcgaag aggcagtcaacggggaaactcagcaagcgcacttacaggcgattaaagagctgatagcgcgtgacaaaaaccacccaagcgtggtgatgtggagtatttgccaacga accggatacccgtccgcaaggtgcacgggaatatttcgcgccactggcggaagcaacgcgtaaactcgacccgacgcgtccgatcacctgcgtcaatgtaatgttctg cgacgctcacaccgataccatcagcgatctctttgatgtgctgtgcctgaaccgttattacggatggtatgtccaaagcggcgatttggaaacggcagagaaggtactgg aaaaagaacttctggcctggcaggagaaactgcatcagccgattatcatcaccgaatacggcgtggatacgttagccgggctgcactcaatgtacaccgacatgtgga gtgaagagtatcagtgtgcatggctggatatgtatcaccgcgtctttgatcgcgtcagcgccgtcgtcggtgaacaggtatggaatttcgccgattttgcgacctcgcaag gcatattgcgcgttggcggtaacaagaaagggatcttcactcgcgaccgcaaaccgaagtcggcggcttttctgctgcaaaaacgctggactggcatgaacttcggtga aaaaccgcagcagggaggcaaacaatgaatcaacaactctcctggcgcaccatcgtcggctacagcctcgggaattgctaccggggttcgaaatcgatgggtgttattt gtggataataaattcgggtgatgttcagtgtttgtcgtatttctcacgaataaattgtgtttatgtatgtgttagtgttgtttgtctgtttcagaccctcttatgttatatttttctttt cgtcggtcagttgaagccaatactggtgtcctggccggcactgcaataccatttcgtttaatataaagactctgttatccgtgagctcgaatttccccgatcgttcaaacattttggc aataaagtttcttaagattgaatcctgttgccggtcttgcgatgattatcatataatttctgttgaattacgttaagcatgtaataattaacatgtaatgcatgacgttatttatgaga tgggtttttatgattagagtcccgcaattatacatttaatacgcgatagaaaacaaaatatagcgcgcaaactaggataaattatcgcgcgcggtgtcatctatgttactagat cgcggccgcatttgggctcctgcaggtaccttaattaaaagtttaaactatcagtgtttgacaggatatattggcgggtaaacctaagagaaaagagcgtttatagaataat cggatatttaaaagggcgtgaaaaggtttatccgttcgtccatttgtatgtgcatgccaaccacagggttccccagatc 62 pWVCZ24 cgccggcgtt gtggatacct cgcggaaaac ttggccctca ctgacagatg aggggcggac    60 gttgacactt gaggggccga ctcacccggc gcggcgttga cagatgaggg gcaggctcga   120 tttcggccgg cgacgtggag ctggccagcc tcgcaaatcg gcgaaaacgc ctgattttac   180 gcgagtttcc cacagatgat gtggacaagc ctggggataa gtgccctgcg gtattgacac   240 ttgaggggcg cgactactga cagatgaggg gcgcgatcct tgacacttga ggggcagagt   300 gctgacagat gaggggcgca cctattgaca tttgaggggc tgtccacagg cagaaaatcc   360 agcatttgca agggtttccg cccgtttttc ggccaccgct aacctgtctt ttaacctgct   420 tttaaaccaa tatttataaa ccttgttttt aaccagggct gcgccctgtg cgcgtgaccg   480 cgcacgccga aggggggtgc ccccccttct cgaaccctcc cggcccgcta acgcgggcct   540 cccatccccc caggggctgc gcccctcggc cgcgaacggc ctcaccccaa aaatggcagc   600 gctggcagtc cataattgtg ggctgagaga cataattgtg gtttgtgttt ccatattgtt   660 catctcccat tgatcgtatt aagaaagtat gatggtgatg tcgcagcctt ccgctttcgc   720 ttcacggaaa acctgaagca cactctcggc gccattttca gtcagctgct tgctttgttc   780 aaactgcctc cattccaaaa cgagcgggta ctccacccat ccggtcagac aatcccataa   840 agcgtccagg ttttcaccgt agtattccgg aagggcaagc tcctttttca atgtctggtg   900 gaggtcgctg atacttctga tttgttcccc gttaatgact gcttttttca tgtgcggctc   960 ctttcttatg attttattct atcagtttac catttttctc ttcagaaatg gccggattct  1020 gccccggttt cagctggacg atgtcctcct gtctcggacg gctgctgcaa attggaccac  1080 attatggtct ctcccataat tgtggtttca aaatcggctc cgtcgatact atgttatacg  1140 ccaactttga aaacaacttt gaaaaagctg ttttctggta tttaaggttt tagaatgcaa  1200 ggaacagtga attggagttc gtcttgttat aattagcttc ttggggtatc tttaaatact  1260 gtagaaaaga ggaaggaaat aataaatggc taaaatgaga atatcaccgg aattgaaaaa  1320 actgatcgaa aaataccgct gcgtaaaaga tacggaagga atgtctcctg ctaaggtata  1380 taagctggtg ggagaaaatg aaaacctata tttaaaaatg acggacagcc ggtataaagg  1440 gaccacctat gatgtggaac gggaaaagga catgatgcta tggctggaag gaaagctgcc  1500 tgttccaaag gtcctgcact ttgaacggca tgatggctgg agcaatctgc tcatgagtga  1560 ggccgatggc gtcctttgct cggaagagta tgaagatgaa caaagccctg aaaagattat  1620 cgagctgtat gcggagtgca tcaggctctt tcactccatc gacatatcgg attgtcccta  1680 tacgaatagc ttagacagcc gcttagccga attggattac ttactgaata acgatctggc  1740 cgatgtggat tgcgaaaact gggaagaaga cactccattt aaagatccgc gcgagctgta  1800 tgatttttta aagacggaaa agcccgaaga ggaacttgtc ttttcccacg gcgacctggg  1860 agacagcaac atctttgtga aagatggcaa agtaagtggc tttattgatc ttgggagaag  1920 cggcagggcg gacaagtggt atgacattgc cttctgcgtc cggtcgatca gggaggatat  1980 cggggaagaa cagtatgtcg agctattttt tgacttactg gggatcaagc ctgattggga  2040 gaaaataaaa tattatattt tactggatga attgttttag tacctagatg tggcgcaacg  2100 atgccggcga caagcaggag cgcaccgact tcttccgcat caagtgtttt ggctctcagg  2160 ccgaggccca cggcaagtat ttgggcaagg ggtcgctggt attcgtgcag ggcaagattc  2220 ggaataccaa gtacgagaag gacggccaga cggtctacgg gaccgacttc attgccgata  2280 aggtggatta tctggacacc aaggcaccag gcgggtcaaa tcaggaataa gggcacattg  2340 ccccggcgtg agtcggggca atcccgcaag gagggtgaat gaatcggacg tttgaccgga  2400 aggcatacag gcaagaactg atcgacgcgg ggttttccgc cgaggatgcc gaaaccatcg  2460 caagccgcac cgtcatgcgt gcgccccgcg aaaccttcca gtccgtcggc tcgatggtcc  2520 agcaagctac ggccaagatc gagcgcgaca gcgtgcaact ggctccccct gccctgcccg  2580 cgccatcggc cgccgtggag cgttcgcgtc gtctcgaaca ggaggcggca ggtttggcga  2640 agtcgatgac catcgacacg cgaggaacta tgacgaccaa gaagcgaaaa accgccggcg  2700 aggacctggc aaaacaggtc agcgaggcca agcaggccgc gttgctgaaa cacacgaagc  2760 agcagatcaa ggaaatgcag ctttccttgt tcgatattgc gccgtggccg gacacgatgc  2820 gagcgatgcc aaacgacacg gcccgctctg ccctgttcac cacgcgcaac aagaaaatcc  2880 cgcgcgaggc gctgcaaaac aaggtcattt tccacgtcaa caaggacgtg aagatcacct  2940 acaccggcgt cgagctgcgg gccgacgatg acgaactggt gtggcagcag gtgttggagt  3000 acgcgaagcg cacccctatc ggcgagccga tcaccttcac gttctacgag ctttgccagg  3060 acctgggctg gtcgatcaat ggccggtatt acacgaaggc cgaggaatgc ctgtcgcgcc  3120 tacaggcgac ggcgatgggc ttcacgtccg accgcgttgg gcacctggaa tcggtgtcgc  3180 tgctgcaccg cttccgcgtc ctggaccgtg gcaagaaaac gtcccgttgc caggtcctga  3240 tcgacgagga aatcgtcgtg ctgtttgctg gcgaccacta cacgaaattc atatgggaga  3300 agtaccgcaa gctgtcgccg acggcccgac ggatgttcga ctatttcagc tcgcaccggg  3360 agccgtaccc gctcaagctg gaaaccttcc gcctcatgtg cggatcggat tccacccgcg  3420 tgaagaagtg gcgcgagcag gtcggcgaag cctgcgaaga gttgcgaggc agcggcctgg  3480 tggaacacgc ctgggtcaat gatgacctgg tgcattgcaa acgctagggc cttgtggggt  3540 cagttccggc tgggggttca gcagccagcg ctttactggc atttcaggaa caagcgggca  3600 ctgctcgacg cacttgcttc gctcagtatc gctcgggacg cacggcgcgc tctacgaact  3660 gccgatagac aactgtcacg gttaagcgag aaatgaataa gaaggctgat aattcggatc  3720 tctgcgaggg agatgatatt tgatcacagg cagcaacgct ctgtcatcgt tacaatcaac  3780 atgctaccct ccgcgagatc atccgtgttt caaacccggc agcttagttg ccgttcttcc  3840 gaatagcatc ggtaacatga gcaaagtctg ccgccttaca acggctctcc cgctgacgcc  3900 gtcccggact gatgggctgc ctgtatcgag tggtgatttt gtgccgagct gccggtcggg  3960 gagctgttgg ctggctggtg gcaggatata ttgtggtgta aacaaattga cgcttagaca  4020 acttaataac acattgcgga cgtttttaat gtactggggt ggtttttctt ttcaccagtg  4080 agacgggcaa cagctgattg cccttcaccg cctggccctg agagagttgc agcaagcggt  4140 ccacgctggt ttgccccagc aggcgaaaat cctgtttgat ggtggttccg aaatcggcaa  4200 aatcccttat aaatcaaaag aatagcccga gatagggttg agtgttgttc cagtttggaa  4260 caagagtcca ctattaaaga acgtggactc caacgtcaaa gggcgaaaaa ccgtctatca  4320 gggcgatggc ccacggccgc tctagaacta gtggatccac cagaaccacc accagagccg  4380 ccgccagcat tgacaggagg cccgatctag taacatagat gacaccgcgc gcgataattt  4440 atcctagttt gcgcgctata ttttgttttc tatcgcgtat taaatgtata attgcgggac  4500 tctaatcata aaaacccatc tcataaataa cgtcatgcat tacatgttaa ttattacatg  4560 cttaacgtaa ttcaacagaa attatatgat aatcatcgca agaccggcaa caggattcaa  4620 tcttaagaaa ctttattgcc aaatgtttga acgatcgggg atcatccggg tctgtggcgg  4680 gaactccacg aaaatatccg aacgcagcaa gatatcgcgg tgcatctcgg tcttgcctgg  4740 gcagtcgccg ccgacgccgt tgatgtggac gccgggcccg atcatattgt cgctcaggat  4800 cgtggcgttg tgcttgtcgg ccgttgctgt cgtaatgata tcggcacctt cgaccgcctg  4860 ttccgcagag atcccgtggg cgaagaactc cagcatgaga tccccgcgct ggaggatcat  4920 ccagccggcg tcccggaaaa cgattccgaa gcccaacctt tcatagaagg cggcggtgga  4980 atcgaaatct cgtgatggca ggttgggcgt cgcttggtcg gtcatttcga accccagagt  5040 cccgctcaga agaactcgtc aagaaggcga tagaaggcga tgcgctgcga atcgggagcg  5100 gcgataccgt aaagcacgag gaagcggtca gcccattcgc cgccaagctc ttcagcaata  5160 tcacgggtag ccaacgctat gtcctgatag cggtccgcca cacccagccg gccacagtcg  5220 atgaatccag aaaagcggcc attttccacc atgatattcg gcaagcaggc atcgccatgg  5280 gtcacgacga gatcatcgcc gtcgggcatg cgcgccttga gcctggcgaa cagttcggct  5340 ggcgcgagcc cctgatgctc ttcgtccaga tcatcctgat cgacaagacc ggcttccatc  5400 cgagtacgtg ctcgctcgat gcgatgtttc gcttggtggt cgaatgggca ggtagccgga  5460 tcaagcgtat gcagccgccg cattgcatca gccatgatgg atactttctc ggcaggagca  5520 aggtgagatg acaggagatc ctgccccggc acttcgccca atagcagcca gtcccttccc  5580 gcttcagtga caacgtcgag cacagctgcg caaggaacgc ccgtcgtggc cagccacgat  5640 agccgcgctg cctcgtcctg cagttcattc agggcaccgg acaggtcggt cttgacaaaa  5700 agaaccgggc gcccctgcgc tgacagccgg aacacggcgg catcagagca gccgattgtc  5760 tgttgtgccc agtcatagcc gaatagcctc tccacccaag cggccggaga acctgcgtgc  5820 aatccatctt gttcaatcat gcgaaacgat ccagatccgg tgcagattat ttggattgag  5880 agtgaatatg agactctaat tggataccga ggggaattta tggaacgtca gtggagcatt  5940 tttgacaaga aatatttgct agctgatagt gaccttaggc gacttttgaa cgcgcaataa  6000 tggtttctga cgtatgtgct tagctcatta aactccagaa acccgcggct gagtggctcc  6060 ttcaacgttg cggttctgtc agttccaaac gtaaaacggc ttgtcccgcg tcatcggcgg  6120 gggtcataac gtgactccct taattctccg ctcatgatca gattgtcgtt tcccgccttc  6180 agtttaaact atcagtgttg cggccgcggc gcgccttccc gatctagtaa catagatgac  6240 accgcgcgcg ataatttatc ctagtttgcg cgctatattt tgttttctat cgcgtattaa  6300 atgtataatt gcgggactct aatcataaaa acccatctca taaataacgt catgcattac  6360 atgttaatta ttacatgctt aacgtaattc aacagaaatt atatgataat catcgcaaga  6420 ccggcaacag gattcaatct taagaaactt tattgccaaa tgtttgaacg atcggggaaa  6480 ttcgagctca aagtgcaatt gaccgatcag agtttgaaga aaaatttatt acacacttta  6540 tgtaaagctg aaaaaaacgg cctcccgcag ggaagccgtt tttttcgtta tctgattttt  6600 gtagaggtct gataatggtc cgttgttttg taaatcagcc agtcgcttga gtaaagaatc  6660 cggtctgaat ttctgaagcc tgatgtatag ttaatatccg cttcacgcca tgttcgtccg  6720 cttttgcccg ggagtttgcc ttccctgttt gagaagatgt ctccgccgat gcttttcccc  6780 ggagcgacgt ctgcaaggtt cccttttgat gccacccagc cgagggcttg tgcttctgat  6840 tttgtaatgt aattatcagg tagcttatga tatgtctgaa gataatccgc aaccccgtca  6900 aacgtgttga taacctgtgc catgatttgt acacaaaatt tccgcgcaca gatcctcaca  6960 gcgtatgcaa aacaaagctg caactactaa taccagtcca aaagcaatgg gcgcaacagc  7020 aacagcaaaa gctgcaaccc cttgtgctgg ttcgttccta cagttggacg cagcccgagt  7080 tctgagaaac aaataaccac aaggcaagtt aggtaccaaa ccccttaagc tcaacttaag  7140 caaatattac aatcgtttgt ttctacaaac aaatcttttt cagaacggct tcaggtgggg  7200 aatattgtcc atttaagtac ctgaaaatct aagaacacgg ccaatccggg cgcctttgct  7260 tgaaagtggg aagaaacctg aatgattgaa cagtggataa gagatttata agcaagatta  7320 gcagggctga tcagattgtt ttttcgggta ggttgatcaa tacatatgcc ccttccctct  7380 tcctttcctc tacaatcgat tgccagggag agatagagat accatcatga tgatgatggt  7440 ggggatggcg atgatggtaa tgatgatgat ccagcagaaa aaattgcgca gaagaagaag  7500 atgagcggtc ggtcggtcga tagcctttca gtcggagggg aaagaacaaa ataatgccta  7560 tttgaaggca gatggattga ctaagacgtg tgcaggcagt ggaggagtta caaggcagga  7620 catatttact aggtataggt gtaggtaata gtaatggaga ggataaattt aggttttggg  7680 atgaatggat ttgttggtac atgttgcaac tcccacactg caatcaaagg accgctatga  7740 caccccctga atgcgacgcc catgagaatg ccgaccccac atatacattt ctggaaataa  7800 tagggaaatg cacccttgca ttatatttca tttattcgtc ctccattttg tgcgctctcc  7860 attcattttc aaatgcgctc cactcttcct ttatttctta ccaccattat ctcgtattcg  7920 aggtccagaa atcaagttgt gaatctgcct tggttgcgca ttgttaaagt actcttctgt  7980 gtatatttct gccccaccgt tttcacttcc aacacttaaa tttttttatt ttttatttta  8040 tatatttctt ataaattgtt ggcttctcac acgaacccaa gccatccaag ccccgacaaa  8100 ggcaatccaa tgtacttgac tagagtcaaa taccttttac ttctttactt ctcatattac  8160 ccagaagcca agccaacctt accaaactaa tgtacctgag cagagtccac tacctttcct  8220 caagtacagt ggcagtcaga gtatatcacc gcttgttatg tatatgcttt aatgctatgc  8280 ttatttctag gtcataatct aaatcatatt tgctgtcgag tttaagctta tcgataccgt  8340 cgacctcgag cttcttcttg aatgctctta tgggtaggat tatttttcac ttttttcctt  8400 catattccac acacatatat atataaacac actaacatta gtgggaatat ttgtttgata  8460 tgtttatttt atttacttcg ggggtttttg taacaatttt gtagatctaa tttcttgtct  8520 tcatgtgtat attaattttc ccttaagact taaataaaaa gagagagttt gttatatata  8580 gatatatgaa gtgagggaaa tggtacaaag ttaaaggaga tctgagtgag agttagataa  8640 taaatgaaaa gaaataagaa accatcaggg ttttttctaa tgtggagttt tagattcagt  8700 tttgtagaac taagattcac tttgttgggt gttctttctt cactcatttc tgttattata  8760 ataataataa aatcttatat ctttctattt tccttactaa caagtacttg aagatttaga  8820 tatatttata gatctggtgt tgtaataggt aaaaacttga tttttatgac tataaaagta  8880 agttttggga aacaaattgg ggagagagta aggaaggact atgaggtcat atcttctgtt  8940 ttgtgatcat ccatcctcca ttgttgttaa tgtctgtgtc tctctttttc ttctcttctt  9000 tctcttactt tcctttctta tctctagctc tctttctctc tcatgaatta tatcatatca  9060 tatatttgat acaaacacat gtgatggtaa gtgagagtga ataaggtgaa actagctaga  9120 tttttgagtt ttcatgaaat tttaacttat atgagtgata gaaaataatg gaacttatac  9180 gtacatgtag gacaatttag atggttatct aagtttttgt ttttgttttc tcttgagaat  9240 gttaaatgtt agtgttattt ttgtagtttt ggaaaattat atatgagcta agattagttt  9300 agaagtggtc aaaagaaaca tagatttgaa atttcaactg aattttcaag atttcaaata  9360 gtcaatgaaa caaggaggta attaagacaa attagcttat ggggactctt ttttgttatt  9420 ccttaaaatt actcttttta aaattaaaaa taactaatct catttcgaac tacattactc  9480 aaactagtaa tctctaattc gacacgcaat ttccaaatac ttattagtag agagtcccac  9540 gtgattactt tcttctccac caaaacataa aacatgtcaa gattaaatgg tgtttgaaaa  9600 ttaaaagatc aattttctta atcgtttaca gttgtcaact ctcatgtcct gaaatatata  9660 attctcatgt ccaaaacaag aaaagctaac aacgacttca aattaaatca gtcaatcaaa  9720 attagtcttc atttacctac taatttcttt ttatatatcc gatgggtact ctacgaaatc  9780 agagtttcgt ttctttattt attttctttt ataagatttt tgaggttttt tcagaggttg  9840 gaattgagcg caagattagg ttttgggtct gtaagatttg ttgtctttgt taaagaatct  9900 ttgatcacgt catcactcag atattatttc tttttatttt tcatttgtat ttttactaat  9960 ttattataaa gttttgttag tttcagttct tgacttctga caagaaggtt ttatgtcata 10020 atgaattaat ttgtaaccta tttataaatt caaaaatgtc atcatattac tacttttgac 10080 catttaatat tagatttctc atttggtcaa tacccaatgt tcatattaca tatatagaga 10140 caaaaattat aaggatacta aattgttcat atttcttgga agtaaaaaga ttaatgatca 10200 ctgaataaat agatttggca tagaagtata gcattggaat tgcttcaaca tctttggtgt 10260 agatagattt atgcaatttc tctttctttt tgaagtatct ttttttttct agagagagaa 10320 taatgttagg gatttttatc attttctctc tcattatggg tactgagagg aaagtgagat 10380 ttttagtacg gatccaatag tttaagagtt tggtctgcct tctacgatcc aaaaaaatct 10440 acggtcatga tctctccatc gagaaggttg agagttcaga catcaaagtc tataatatgt 10500 cattgtaata cgtatttgtg catatatatc tatgtacaag tacatataca ggaaactcaa 10560 gaaaaaagaa taaatggtaa atttaattat attccaaata aggaaagtat ggaacgttgt 10620 gatgttactc ggacaagtca tttagttaca tccatcacgt ttaaatttaa tccaatggtt 10680 acaattttaa tactatcaaa tgtctattgg atttataccc aatgtgttaa tgggttgttg 10740 acacatgtca catgtctgaa accctagaca tgttcagacc aatcatgtca ctctaatttt 10800 gccagcatgg cagttggcag ccaatcacta gctcgataaa tttaaggttt cagaggaatt 10860 ttaatttatt tagggttcat attgtttcat aaaatgattc tttatttgtt acaactttaa 10920 ggaaatattt tattaactat ttaattgttc ccttttctta tattactttt gttttttctt 10980 cacatcatgt gtcacattaa gttgcatttc ttctgactca aaagaaccga tgtttgcttt 11040 taaggtttcg tattagaatc acttaactgt gcaagtggtc gatttgaccc tatcaagctt 11100 gatatcgaat tcctgcagcc cgggctcctg caggtacctt aattaaaagt ttaaactatc 11160 agtgtttgac aggatatatt ggcgggtaaa cctaagagaa aagagcgttt attagaataa 11220 tcggatattt aaaagggcgt gaaaaggttt atccgttcgt ccatttgtat gtgcatgcca 11280 accacagggt tccccagatc 63 pARB1005L cgccggcgttgtggatacctcgcggaaaacttggccctcactgacagatgaggggcggacgttgacacttgaggggccgactcacccggcgcggcgttgacagatg aggggcaggctcgatttcggccggcgacgtggagctggccagcctcgcaaatcggcgaaaacgcctgattttacgcgagtttcccacagatgatgtggacaagcctg gggataagtgccctgcggtattgacacttgaggggcgcgactactgacagatgagggcgcgatccttgacacttgaggggcagagtgctgacagatgaggggcgc acctattgacatttgaggggctgtccacaggcagaaaatccagcatttgcaagggtttccgcccgtttttcggccaccgctaacctgtcttttaacctgcttttaaaccaatat ttataaaccttgtttttaaccagggctgcgccctgtgcgcgtgaccgcgcacgccgaaggggggtgcccccccttctcgaacctcccggcccgctaacgcgggcctc ccatccccccaggggctgcgcccctcggccgcgaacggcctcaccccaaaaatggcagcgctggcagtccataattgtggtccaatttgcagccgtccgagacagg aggacatcgtccagctgaaaccggggcagaatccggccatttctgaagagaaaaatggtaaactgatagaataaaatcataagaaaggagccgcacatgaaaaaagc agtcattaacggggaacaaatcagaagtatcagcgacctccaccagacattgaaaaaggagcttgcccttccggaatactacggtgaaaacctggacgctttatgggat tgtctgaccggatgggtggagtacccgctcgttttggaatggaggcagtttgaacaaagcaagcagctgactgaaaatggcgccgagagtgtgcttcaggttttccgtga agcgaaagcggaaggctgcgacatcaccatcatactttcttaatacgatcaatgggagatgaacaatatggaaacacaaaccacaattgtggtttcaaaatcggctccgt cgatactatgttatacgccaactttgaaaacaactttgaaaaagctgttttctggtatttaaggttttagaatgcaaggaacagtgaattggagttcgtcttgttataattagcttc ttggggtatctttaaatactgtagaaaagaggaaggaaataataaatggctaaaatgagaatatcaccggaattgaaaaaactgatcgaaaaataccgctgcgtaaaaga tacggaaggaatgtctcctgctaaggtatataagctggtgggagaaaatgaaaacctatatttaaaaatgacggacagccggtataaagggaccacctatgatgtggaa cgggaaaaggacatgatgctatggctggaaggaaagctgcctgttccaaaggtcctgcactttgaacggcatgatggctggagcaatctgctcatgagtgaggccgat ggcgtcctttgctcggaagagtatgagatgaacaaagccctgaaaagattatcgagctgtatgcggagtgcatcaggctctttcactccatcgacatatcggattgtccc tatacgaatagcttagacagccgcttagccgaattggattacttactgaataacgatctggccgatgtggattgcgaaaactgggaagaagacactccatttaaagatccg cgcgagctgtatgattttttaaagacggaaaagcccgaagaggaacttgtcttttcccacggcgacctgggagacagcaacatctttgtgaaagatggcaaagtaagtgg ctttattgatcttgggagaagcggcagggcggacaagtggtatgacattgccttctgcgtccggtcgatcagggaggatatcggggaagaacagtatgtcgagctattttt tgacttactggggatcaagcctgattgggagaaaataaaatattatattttactggatgaattgttttagtacctagatgtggcgcaacgatgccggcgacaagcaggagc gcaccgacttcttccgcatcaagtgttttggctctcaggccgaggcccacggcaagtattgggcaaggggtcgctggtattcgtgcagggcaagattcggaataccaa gtacgagaaggacggccagacggtctacgggaccgacttcattgccgataaggtggattatctggacaccaaggcaccaggcgggtcaaatcaggaataagggcac attgccccggcgtgagtcggggcaatcccgcaaggagggtgaatgaatcggacgtttgaccggaaggcatacaggcaagaactgatcgacgcggggttttccgccg aggatgccgaaaccatcgcaagccgcaccgtcatgcgtgcgccccgcgaaaccttccagtccgtcggctcgatggtccagcaagctacggccaagatcgagcgcg acagcgtgcaactggctccccctgccctgcccgcgccatcggccgccgtggagcgttcgcgtcgtctcgaacaggaggcggcaggtttggcgaagtcgatgaccat cgacacgcgaggaactatgacgaccaagaagcgaaaaaccgccggcgaggacctggcaaaacaggtcagcgaggccaagcaggccgcgttgctgaaacacacg aagcagcagatcaaggaaatgcagctttccttgttcgatattgcgccgtggccggacacgatgcgagcgatgccaaacgacacggcccgctctgccctgttcaccacg cgcaacaagaaaatcccgcgcgaggcgctgcaaaacaaggtcattttccacgtcaacaaggacgtgaagatcacctacaccggcgtcgagctgcgggccgacgatg acgaactggtgtggcagcaggtgttggagtacgcgaagcgcacccctatcggcgagccgatcaccttcacgttctacgagctttgccaggacctgggctggtcgatca atggccggtattacacgaaggccgaggaatgcctgtcgcgcctacaggcgacggcgatgggcttcacgtccgaccgcgttgggcacctggaatcggtgtcgctgctg caccgcttccgcgtcctggaccgtggcaagaaaacgtcccgttgccaggtcctgatcgacgaggaaatcgtcgtgctgtttgctggcgaccactacacgaaattcatat gggagaagtaccgcaagctgtcgccgacggcccgacggatgttcgactatttcagctcgcaccgggagccgtacccgctcaagctggaaaccttccgcctcatgtgc ggatcggattccacccgcgtgaagaagtggcgcgagcaggtcggcgaagcctgcgaagagttgcgaggcagcggcctggtggaacacgcctgggtcaatgatgac ctggtgcattgcaaacgctagggccttgtggggtcagttccggctgggggttcagcagccagcgctttactggcatttcaggaacaagcgggcactgctcgacgcactt gcttcgctcagtatcgctcgggacgcacggcgcgctctacgaactgccgatagacaactgtcacggttaagcgagaaatgaataagaaggctgataattcggatctctg cgagggagatgatatttgatccggtgtgaaataccgcacagatgcgtaaggagaaaataccgcatcaggcgctcttccgcttcctcgctcactgactcgctgcgctcggt cgttcggctgcggcgagcggtatcagctcactcaaaggcggtaatacggttatccacagaatcaggggataacgcaggaaagaacatgtgagcaaaaggccagcaa aaggccaggaaccgtaaaaaggccgcgttgctggcgtttttccataggctccgcccccctgacgagcatcacaaaaatcgacgctcaagtcagaggtggcgaaaccc gacaggactataaagataccaggcgtttccccctggaagctccctcgtgcgctctcctgttccgaccctgccgcttaccggatacctgtccgcctttctccttcgggaag cgtggcgctttctcatagctcacgctgtaggtatctcagttcggtgtaggtcgttcgctccaagctgggctgtgtgcacgaaccccccgttcagcccgaccgctgcgcctt atccggtaactatcgtcttgagtccaacccggtaagacacgacttatcgccactggcagcagccactggaacaggattagcagagcgaggtatgtaggcggtgctaca gagttcttgaagtggtggcctaactacggctacactagaaggacagtatttggtatctgcgctctgctgaagccagttaccttcggaaaaagagttggtagctcttgatccg gcaaacaaaccaccgctggtagcggtggtttttttgtttgcaagcagcagattacgcgcagaaaaaaaggatatcaagaagatcctttgatcttttctacggggtctgagc ctcagtggaacgaaaactcacgttaagggattttggtcatgagattatcaaaaaggatcttcacctagatccttttaattaaaaatgaagttttaaatcaatctaaagtatata tgagtaaacttggtctgacagttaccaatgcttcatcagtgaggctgatcacggcagcaacgctctgtcatcgttacaatcaacatgcaccctccgcgagatcatccgt gtttcaaacccggcagcttagttgccgttcttccgaatagcatcggtaacatgagcaaagtctgccgccttacaacggctctcccgctgacgccgtcccggactgatggg ctgcctgtatcgagtggtgattttgtgccgagctgccggtcggggagctgttggctggctggtggcaggatatattgtggtgtaaacaaattgacgcttagacaacttaata acacaccgcggtctagaactagtggatcccccctacgtgcgatctagtaacatagatgacaccgcgcgcgataatttatcctagtttgcgcgctatattttgttttctatcgc gtattaaatgtataattgcgggactctaatcataaaaacccatctcataaataacgtcatgcattacatgttaattattacatgcttaacgtaattcaacagaaattatatgataat catcgcaagaccggcaacaggattcaatcttaagaaactttattgccaaatgtttgaacgatccctcagaagaactcgtcaagaaggcgatagaaggcgatgcgctgcg aatcgggagcggcgataccgtaaagcacgaggaagcggtcagcccattcgccgccaagctcttcagcaatatcacgggtagccaacgctatgtcctgatagcggtcc gccacacccagccggccacagtcgatgaatccagaaaagcggccattttccaccatgatattcggcaagcaggcatcgccatgggtcacgacgagatcctcgccgtc gggcatgcgcgccttgagcctggcgaaccagttcggctgcgcgagcccctgatgctcttcgtccagatcatcctgatcgacaagaccggcttccatccggtacgtgct cgctcgatgcgatgtttcgcttggtggtcgaatgggcaggtagccggatcaagcgtatgcagccgccgcattgcatcagccatgatggatactttctcggcaggagcaa ggtgagatgacaggagatcctgccccggcacttcgcccaatagcagccagtcccttcccgcttcagtgacaacgtcgagcacagctgcgcaaggaacgcccgtcgtg gccagccacgatagccgcgctgcctcgtcctggagttcattcagggcaccggacaggtcggtcttgacaaaaagaaccgggcgcccctgcgctgacagccggaaca cggcggcatcagagcagccgattgtctgttgtgcccagtcatagccgaatagcctctccacccaagcggccggagaacctgcgtgcaatccatcttgttcaatcatctgt taatcagaaaaactcagattaatcgacaaattcgatcgcacaaaactagaaactaacaccagatctagatagaaatcacaaatcgaagagtaattattcgacaaaactcaa attatttgaacaaatcggatgatatttatgaaaccctaatcgagaattaagatgatatctaacgatcaaacccagaaaatcgtcttcgatctaagattaacagaatctaaacca aagaacatatacgaaattgggatcgaacgaaaacaaaatcgaagattttgagagaataaggaacacagaaatttaccttgatcacggtagagagaattgagagaaagtt tttaagattttgagaaattgaaatctgaattgtgaagaagaagagctctttgggtattgttttatagaagaagaagaagaaaagacgaggacgactaggtcacgagaaagc taaggcggtgaagcaatagctaataataaaatgacacgtgtattgagcgttgtttacacgcaaagtgtttttggctaattgccttatttttaggttgaggaaaagtatttgtgct ttgagttgataaaacacgactcgtgtgtgccggctgcaaccactttgacgccgtttattactgactcgtcgacaaccacaatttctaacggtcgtcataagatccagccgttga gatttaacgatcgttacgatttatatttttttagcattatcgttttattttttaaatatacggtggagctgaaaattggcaataattgaaccgtgggtcccactgcattgaagcgtatt tcgtattttctagaattcttcgtgctttatttcttttcctttttgttttttttgccatttatctaatgcaagtgggcttataaaatcagtgaatttcttggaaaagtaacttcttatcgt ataacatattgtgaaattatccatttcttttaattttttagtgttattggatatttttgtatgattattgatttgcataggataatgacttttgtatcaagttggtgaacaagtctcgtt aaaaaaggcaagtggtttggtgactcgatttattcttgttatttaattcatatatcaatggatcttatttggggcctggtccatatttaacactcgtgtgtcagtccaatgaccaataat attttttcattaataacaatgtaacaagaatgatacacaaaacattctttgaataagttcgctatgaagaagggaacttatccggtcctagatcatcagttcatacaaacctccataga gttcaacatcttaaacaaggatatcctgatccgttgacggcgcgccttcccgatctagtaacatagatgacaccgcgcgcgataatttatcctagtttgcgcgctatattttgt tttctatcgcgtattaaatgtataattgcgggactctaatcataaaaacccatctcataaataacgtcatgcattacatgttaattattacatgcttaacgtaattcaacagaaatt atatgataatcatcgcaagaccggcaacaggattcaatcttaagaaactttattgccaaatgtttgaacgatcggggaaattcgagctcaaagtgcaattgaccgatcaga gtttgaagaaaaatttattacacactttatgtaaagctgaaaaaaacggcctcccgcagggaagccgtttttttcgttatctgatttttgtaaaggtctgataatggtccgttgtt tgtaaatcagccagtcgcttgagtaaagaatccggtctgaatttctgaagcctgatgtatagttaatatccgctccacgccatgttcgtccgcttttgcccgggagtttgcctt ccctgtttgagaagatgtctccgccgatgcttttccccggagcgacgtctgcaaggttcccttttgatgccacccagccgagggcttgtgcttctgattttgtaatgtaattat caggtagcttatgatatgtctgaagataatccgcaaccccgtcaaacgtgttgataacctgtgccatgatttgtacacaaaatttccgcgcacagatcctcacagcgtatgc aaaacaaagctgcaactactaataccagtccaaaagcaatgggcgcaacagcaacagcaaaagctgcaaccccttgtgctggttcgttcctacagttggacgcagccc gagttctgagaaacaaataaccacaaggcaagttaggtaccaaaccccttaagctcaacttaagcaaatattacaatcgtttgtttctacaaatctttttcagaacggc ttcaggtggggaatattgtccatttaagtacctgaaaatctaagaacacggccaatccgggcgcctttgcttgaaagtgggaagaaacctgaatgattgaacagtggataa gagatttataagcaagattagcagggctgatcagattgttttttcgggtaggttgatcaatacatatgccccttccctcttcctttcctctacaatcgattgccagggagagata gagataccatcatgatgatgatggtggggatggcgatgatggtaatgatgatgatccagcagaaaaaattgcgcagaagaagaagatgagcggtcggtcggtcgatag cctttcagtcggaggggaaagaacaaaataatgcctatttgaaggcagatggattgactaagacgtgtgcaggcagtggaggagttacaaggcaggacatatttactag gtataggtgtaggtaatagtaatggagaggataaatttaggttttgggatgaatggatttgttggtacatgttgcaactcccacactgcaatcaaaggaccgctatgacacc ccctgaatgcgacgcccatgagaatgccgaccccacatatacatttctggaaataatagggaaatgcacccttgcattatatttcatttattcgtcctccattttgtgcgctctc cattcattttcaaatgcgctccactcttcctttatttcttaccaccattatctcgtattcgaggtccagaaatcaagttgtgaatctgccttggttgcgcattgttaaagtactcttct gtgtatatttctgccccaccgttttcacttccaacacttaaatttttttattttttattttatatatttcttataaattgttggcttctcacacgaacccaagccatccaagcccgaca aaggcaatccaatgtacttgactagagtcaaataccttttacttctttacttctcatattacccagaagccaagccaaccttaccaaactaatgtacctgagcagagtccacta cctttcctcaagtacagtggcagtcagagtatatcaccgcttgttatgtatatgctttaatgctatgcttatttctaggtcataatctaaatcatatttgctgtcgagtttaagcttat cgataccgtcgacctcgagcttcttcttgaatgctcttatgggtaggattatttttcacttttttccttcatattccacacacatatatatataaacacactaacattagtgggaata tttgtttgatatgtttattttatttacttcgggggtttttgtaaaattttgtagatctaattcttgttcttcatgtgtatattaattttcccttaagacttaaataaaaagagagagttt gttatatatagatatatgaagtgagggaaatggtacaaagttaaaggagatctgagtgagagttagataataaatgaaaagaaataagaaaccatcagggttttttctaatgtgg agttttagattcagttttgtagaactaagattcactttgttgggtgttctttcttcactcatttctgttattataataataataaaatcttatatctttctattttccttactaacaagt acttgaagatttagatatatttatagatctggtgttgtaataggtaaaaacttgatttttatgactataaaagtaagttttgggaaacaaattggggagagagtaaggaaggactatg aggtcatatcttctgttttgtgatcatccatcctccattgttgttaatgtctgtgtctctctttttcttctcttctttctcttactttcctttcttatctctagctctctttctctctca tgaattatatcatatcatatatttgatacaaacacatgtgatggtaagtgagagtgaataaggtgaactagctagatttttgagttttcatgaaattttaacttatatgagtgatagaaa ataatggaacttatacgtacatgtaggacaatttagatggttatctaagtttttgtttttgttttctcttgagaatgttaaagttagtgttatttttgtagttttggaaaattatatatg agctaagattagtttagaagtggtcaaaagaaacatagatttgaaatttcaactgaattttcaagatttcaaatagtcaatgaaacaaggaggtaattaagacaaattagcttatggg gactcttttttgttattccttaaaattactctttttaaaattaaaaataactaatctcatttcgaactacattactcaaactagtaatctctaattcgacacgcaatttccaaatactta ttagtagagagtcccacgtgattactttcttcccaccaaaacataaaacatgtcaagattaaatggtgtttgaaaattaaaagatcaattttcttaatcgtttacagttgtcaact ctcatgtcctgaaatatataattctcatgtccaaaacaagaaaagctaacaacgacttcaaattaaatcagtcaatcaaaattagtcttcattacctactaatttctttttatatat ccgatgggtactctacgaaatcagagtttcgtttctttatttattttcttttataagatttttgaggttttttcagaggttggaattgagcgcaagattaggttttgggtctgtaagatt tgttgtctttgttaaagaatctttgatcacgtcatcactcagatattatttctttttatttttcatttgtatttttactaatttattataaagttttgttagtttcagttcttgacttct gacaagaaggttttatgtcataatgaattaatttgtaacctatttataaattcaaaaatgtcatcatattactacttttgaccatttaatattagatttctcatttggtcaatacccaat gttcatattacatatatagagacaaaaattataaggatactaaattgttcatatttcttggaagtaaaaagattaatgatcactgaataaatagatttggcatagaagtatagcattgga attgcttcaacatctttggtgtagatagatttatgcaatttctctttctttttgaagtatctttttttttctagagagagaataatgttagggatttttatcattttctctctattatgg gtactgagaggaaagtgagatttttagtacggatccaatagttaagagtttggtctgccttctacgatccaaaaaaatctacggtcatgatctctccatcgagaaggttgagagttc agacatcaaagtctataatatgtcattgtaatacgtatttgtgcatatatatctatgtacaagtacatatacaggaaactcaagaaaaaagaataaatggtaaatttaattatatt ccaaataaggaaagtatggaacgttgtgatgttactcggacaagtcatttagttacatccatcacgtttaaatttaatccaatggttacaattttaatactatcaaatgtctattgg atttatacccaatgtgttaatgggttgttgacacatgtcacatgtctgaaaccctagacatgttcagaccaatcatgtcactctaattttgccagcatggcagttggcagccaa tcactagctcgataaatttaaggtttcagaggaattttaatttatttagggttcatattgtttcataaaatgattctttatttgttacaactttaaggaaatattttattaactatttaa ttgttcccttttcttatattactttgttttttcttcacatcatgtgtcacattaagttgcatttcttctgactcaaaagaaccgatgtttgcttttaaggtttcgtattagaatcactta actgtgcaagtggtcgatttgaccctatcaagcttgatatcgaattgcggccgcatttgggctcctgcaggtaccttaattaaaagtttaaactatcagtgtttgacaggatatattgg cgggtaaacctaagagaaaagagcgtttattagaataatcggatatttaaaagggcgtgaaaaggtttatccgttcgtccatttgtatgtgcatgccaaccacagggttccc cagatc 

1. A DNA construct comprising a promoter operably linked to a first DNA segment that corresponds to at least a portion of a gene in the monolignol biosynthetic pathway, a spacer DNA segment, comprising a nucleotide sequence having SEQ ID NO: 33 or a fragment thereof which is at least 50 bp long of a gene having the sequence of SEQ ID NO: 65, wherein said gene is involved in the monolignol biosynthetic pathway, an intron spacer DNA segment, and a second DNA segment that is fully complementary to the first DNA segment, wherein the first and second DNA segments are arranged in a 5′ to 3′ direction, in the DNA construct.
 2. The DNA construct of claim 1 wherein said gene involved in the monolignol biosynthetic pathway 4CL.
 3. The DNA construct of claim 1 wherein said promoter is a constitutive promoter.
 4. The DNA construct of claim 1 wherein said promoter is a tissue-specific promoter.
 5. The DNA construct of claim 4 wherein said promoter directs expression in a vascular-preferred manner such that expression is found in the xylem of plants.
 6. The DNA construct of claim 1 wherein said promoter is a 4CL promoter from P. taeda.
 7. The DNA construct of claim 1 wherein said portion of said gene has a fragment length selected from the group consisting of about 50 bp, 100 bp, 200 bp, 400 bp, 600 bp and 1000 bp.
 8. The DNA construct of claim 1 wherein said portion of said gene has a fragment length of about 200 bp.
 9. The DNA construct of claim 1 wherein said portion of said gene has a fragment length of about 334 bp.
 10. The DNA construct of claim 1 wherein said gene in the monolignol biosynthetic pathway is a 4CL gene.
 11. The DNA construct of claim 10 wherein said portion of said gene is selected from the group consisting of SEQ ID NOS. 18, 19, 20, 21, 22, 23, 24 and
 48. 12. The DNA construct of claim 10 wherein said portion of said gene comprises the nucleotide sequence of SEQ ID NO:
 18. 13. The DNA construct of claim 10 wherein said portion of said gene comprises the nucleotide sequence of SEQ ID NO:
 23. 14. The DNA construct of claim 10 wherein said portion of said gene comprises the nucleotide sequence of SEQ ID NO:
 33. 15. The DNA construct of claim 1 wherein said spacer DNA segment is at least a portion of an intron.
 16. The DNA construct of claim 1 wherein said spacer DNA segment comprises a nucleotide sequence selected from the group consisting of SEQ ID NO: 9, SEQ ID NO: 15 and SEQ ID NO:
 64. 17. The DNA construct of claim 10 wherein said promoter is a vascular-preferred promoter and said first DNA segment comprises the nucleotide sequence of SEQ ID NO:
 33. 18. The DNA construct of claim 10 wherein said promoter is a vascular-preferred promoter and said first DNA segment comprises the nucleotide sequence of SEQ ID NO:
 18. 19. The DNA construct of claim 10 wherein said promoter is a vascular-preferred promoter and said first DNA segment comprises the nucleotide sequence of SEQ ID NO:
 23. 20. The DNA construct of claim 10 wherein said DNA construct is selected from the group consisting of pARB345, pWVK158, pWVK154, pWVK143, pWVC46, pWVC40 and pWVC44.
 21. The DNA construct of claim 1 further comprising at least one T-DNA border.
 22. A DNA construct comprising a promoter operably linked to a first DNA segment that corresponds to at least a portion of a LIM gene, a spacer DNA segment, and a second DNA segment that is complementary to the first DNA segment, wherein the first and second DNA segments are arranged in a 5′ to 3′ direction, respectively, in the DNA construct.
 23. A method of modulating the expression of lignin in a plant comprising introducing into said plant the DNA construct of claim 1 and growing said plant.
 24. A method of inhibiting the expression of lignin in a plant comprising introducing into said plant the DNA construct of claim 1 and growing said plant.
 25. A method of reducing the expression of lignin in a plant comprising introducing into said plant the DNA construct of claim 1 and growing said plant.
 26. A plant comprising the DNA construct of claim
 1. 27. A plant cell comprising the DNA construct of claim
 1. 